This notebook explores how collaborative relationships form between mailing list participants over time.

The hypothesis, loosely put, is that early exchanges are indicators of growing relationships or trust that should be reflected in information flow at later times.


In [1]:
%matplotlib inline

Next we'll import dependencies.


In [2]:
from bigbang.archive import Archive
import bigbang.parse as parse
import bigbang.graph as graph
import bigbang.mailman as mailman
import bigbang.process as process
import bigbang.twopeople as twoppl
import matplotlib.pyplot as plt
import networkx as nx
import numpy as np
import pandas as pd
from pprint import pprint as pp
import networkx as nx
import pytz
import math

Let's begin with just one mailing list to simplify.


In [3]:
url = "http://mail.scipy.org/pipermail/scipy-dev/"
arx= Archive(url,archive_dir="../archives")

In [4]:
arx.data.irow(0).Body


Out[4]:
'The link:\n\nhttp://scipy.net/mailman/listinfo/scipy-dev\n\nyou may email to scipy-dev at scipy.net to post to the list.'

In [5]:
arx.data.shape


Out[5]:
(20405, 6)

In [6]:
arx.data


Out[6]:
From Subject Date In-Reply-To References Body
Message-ID
<NEBBIECAMLMAAKHEGPCGKEBHCLAA.travis@vaught.net> travis at vaught.net (Travis N. Vaught) [SciPy-dev] SciPy Developer mailing list now o... 2001-06-11 02:10:51 None None The link:\n\nhttp://scipy.net/mailman/listinfo...
<Pine.LNX.4.33.0107231957590.15960-100000@oliphant.ee.byu.edu> oliphant at ee.byu.edu (Travis Oliphant) [SciPy-dev] RPMs and source distribution 2001-07-24 02:01:00 <02f001c111bf$2e78a9d0$777ba8c0@190xb01> None I've been playing for hours and finally have i...
<87snfmn8o1.fsf@jeeves.blindglobe.net> rossini at blindglobe.net (A.J. Rossini) [SciPy-dev] RPMs and source distribution 2001-07-24 06:27:42 <Pine.LNX.4.33.0107231957590.15960-100000@olip... <Pine.LNX.4.33.0107231957590.15960-100000@olip... >>>>> "TO" == Travis Oliphant <oliphant at ee....
<Pine.LNX.4.33.0107240041500.16169-100000@oliphant.ee.byu.edu> oliphant at ee.byu.edu (Travis Oliphant) [SciPy-dev] RPMs and source distribution 2001-07-24 06:46:36 <87snfmn8o1.fsf@jeeves.blindglobe.net> None > >>>>> "TO" == Travis Oliphant <oliphant at e...
<87itgisti1.fsf@jeeves.blindglobe.net> rossini at blindglobe.net (A.J. Rossini) [SciPy-dev] RPMs and source distribution 2001-07-24 06:58:46 <Pine.LNX.4.33.0107240041500.16169-100000@olip... <Pine.LNX.4.33.0107240041500.16169-100000@olip... >>>>> "TO" == Travis Oliphant <oliphant at ee....
<20010725005337.PHSJ11181.tomts14-srv.bellnexxia.net@localhost> tjlahey at mud.cgl.uwaterloo.ca (Tim Lahey) [SciPy-dev] Building SciPy on Mac OS X 2001-07-25 00:53:00 None None All,\n\nI've decide that I'm going to tackle g...
<20010725034411.SATM11181.tomts14-srv.bellnexxia.net@localhost> tjlahey at mud.cgl.uwaterloo.ca (Tim Lahey) [SciPy-dev] Building SciPy on Mac OS X Part II 2001-07-25 03:43:46 None None All,\n\nAs I mentioned in my previous message,...
<20010801231139.YWP4521.tomts13-srv.bellnexxia.net@localhost> tjlahey at mud.cgl.uwaterloo.ca (Tim Lahey) [SciPy-dev] Splines in SciPy 2001-08-01 23:09:41 None None All,\n\nGetting the new version of SciPy from ...
<024e01c11b4e$f5510050$7600a8c0@ericlaptop> eric at scipy.org (eric jones) Fw: [SciPy-dev] Splines in SciPy 2001-08-02 12:30:42 None None Hey Tim,\n\n I just noticed this addition too....
<Pine.LNX.4.33.0108021551360.838-100000@sugar.ee.byu.edu> oliphant at ee.byu.edu (Travis Oliphant) [SciPy-dev] Spline module 2001-08-02 21:55:24 <200108021700.f72H03D17694@scipy.org> None >\n> Getting the new version of SciPy from CVS...
<00e601c11bc5$0dbea630$c300a8c0@ericlaptop> eric at scipy.org (eric jones) [SciPy-dev] Spline module 2001-08-03 02:36:09 None <Pine.LNX.4.33.0108021551360.838-100000@sugar.... I'm no splines expert, but I'd hope there was ...
<200108031856.OAA29753@mud.cgl.uwaterloo.ca> tjlahey at mud.cgl.uwaterloo.ca (Tim Lahey) [SciPy-dev] Re: Splines module 2001-08-03 18:56:28 <200108031700.f73H01D31118@scipy.org> from "sc... None > \n> Date: Thu, 2 Aug 2001 15:55:24 -0600 (MD...
<005101c12a92$ce8e2b20$ec88fea9@ericlaptop> eric at scipy.org (eric jones) [SciPy-dev] RPMs and source distribution 2001-08-21 22:44:15 None <Pine.LNX.4.33.0107231957590.15960-100000@olip... Hey Joe,\n\nDebian packages would be good. Yo...
<87u1z1j90m.fsf@phantom.ecn.uiowa.edu> jmr at engineering.uiowa.edu (Joe Reinhardt) [SciPy-dev] RPMs and source distribution 2001-08-21 23:19:53 <87snfmn8o1.fsf@jeeves.blindglobe.net>\n (ross... <Pine.LNX.4.33.0107231957590.15960-100000@olip... Did this debian package ever materialize? \n...
<009f01c12aaa$3bee89a0$ec88fea9@ericlaptop> eric at scipy.org (eric jones) [SciPy-dev] AIX isinf, isfinite, etc. 2001-08-22 01:31:57 None None The good news:\n\nAlmost all of scipy now comp...
<NEBBIECAMLMAAKHEGPCGCEKICMAA.travis@enthought.com> travis at scipy.org (Travis N. Vaught) [SciPy-dev] RPMs and source distribution 2001-08-22 03:44:22 <87u1z1j90m.fsf@phantom.ecn.uiowa.edu> None We haven't heard anything yet about a working ...
<87u1z01ef6.fsf@jeeves.blindglobe.net> rossini at blindglobe.net (A.J. Rossini) [SciPy-dev] RPMs and source distribution 2001-08-22 12:12:45 <NEBBIECAMLMAAKHEGPCGCEKICMAA.travis@enthought... <NEBBIECAMLMAAKHEGPCGCEKICMAA.travis@enthought... >>>>> "TNV" == Travis N Vaught <travis at scip...
<lg13d6klyt3.fsf@pet.ecn.uiowa.edu> jmr at engineering.uiowa.edu (Joe Reinhardt) [SciPy-dev] RPMs and source distribution 2001-08-22 18:44:24 <87u1z01ef6.fsf@jeeves.blindglobe.net>\n (ross... <NEBBIECAMLMAAKHEGPCGCEKICMAA.travis@enthought... rossini at blindglobe.net (A.J. Rossini) write...
<20010902195926.A20656@myrddin.caltech.edu> kern at caltech.edu (Robert Kern) [SciPy-dev] LAPACK is not thread-safe (AFAICT) 2001-09-03 02:59:26 None None Disclaimer: I'm not a FORTRAN programmer or a ...
<3B93B249.A3E8B37D@home.com> europax at home.com (Rob) [SciPy-dev] documentation 2001-09-03 16:39:38 None None Is there a non-html version of the documentati...
<00c101c1370e$f6758010$c100a8c0@ericlaptop> eric at scipy.org (eric jones) [SciPy-dev] scipy.linalg and LAPACK 3.0 2001-09-06 20:03:13 None <20010906133610.A18337@myrddin.caltech.edu> I'd say the latter of the two. I started lina...
<20010906133610.A18337@myrddin.caltech.edu> kern at caltech.edu (Robert Kern) [SciPy-dev] scipy.linalg and LAPACK 3.0 2001-09-06 20:36:10 None None Hi,\n\nSome of the routines wrapped in generic...
<86d74cr9q2.fsf@bock.chem.unc.edu> jochen at unc.edu (Jochen =?iso-8859-1?q?K=FCp... [SciPy-dev] SciPy on Cygwin 2001-09-27 14:26:13 None None I am trying to compile SciPy (cvs) on latest C...
<Pine.LNX.4.33.0109271117580.25675-100000@oliphant.ee.byu.edu> oliphant at ee.byu.edu (Travis Oliphant) [SciPy-dev] Re: Scipy-dev digest, Vol 1 #11 - ... 2001-09-27 17:19:26 <200109271700.f8RH02U04182@scipy.org> None > 1. SciPy on Cygwin (Jochen =?iso-8859-1?q...
<Pine.LNX.4.21.0109271959570.22161-100000@cens.ioc.ee> pearu at cens.ioc.ee (Pearu Peterson) [SciPy-dev] Re: Scipy-dev digest, Vol 1 #11 - ... 2001-09-27 18:10:57 <Pine.LNX.4.33.0109271117580.25675-100000@olip... None On Thu, 27 Sep 2001, Travis Oliphant wrote:\n\...
<Pine.LNX.4.21.0109272015040.22161-100000@cens.ioc.ee> pearu at cens.ioc.ee (Pearu Peterson) [SciPy-dev] Re: Scipy-dev digest, Vol 1 #11 - ... 2001-09-27 18:16:32 <Pine.LNX.4.21.0109271959570.22161-100000@cens... None On Thu, 27 Sep 2001, Pearu Peterson wrote:\n\n...
<20010927124852.A18567@myrddin.caltech.edu> kern at caltech.edu (Robert Kern) [SciPy-dev] Re: Scipy-dev digest, Vol 1 #11 - ... 2001-09-27 19:48:52 <Pine.LNX.4.21.0109271959570.22161-100000@cens... <Pine.LNX.4.33.0109271117580.25675-100000@olip... On Thu, Sep 27, 2001 at 08:10:57PM +0200, Pear...
<20010927164848.Q1356@dothill.com> jason at tishler.net (Jason Tishler) [SciPy-dev] Re: SciPy on Cygwin 2001-09-27 20:48:48 <86d74cr9q2.fsf@bock.chem.unc.edu> None Jochen,\n\nOn Thu, Sep 27, 2001 at 10:26:13AM ...
<86g097pgqt.fsf@bock.chem.unc.edu> jochen at jochen-kuepper.de (Jochen =?iso-8859... [SciPy-dev] SciPy on Cygwin 2001-09-28 14:48:13 <86d74cr9q2.fsf@bock.chem.unc.edu> <86d74cr9q2.fsf@bock.chem.unc.edu> A little further, but now I am having linking ...
<20010928135700.X1356@dothill.com> jason at tishler.net (Jason Tishler) [SciPy-dev] SciPy on Cygwin 2001-09-28 17:57:00 <86g097pgqt.fsf@bock.chem.unc.edu> None Jochen,\n\nOn Fri, Sep 28, 2001 at 10:48:13AM ...
<Pine.LNX.4.21.0109281946030.31117-100000@cens.ioc.ee> pearu at cens.ioc.ee (Pearu Peterson) [SciPy-dev] SciPy on Cygwin 2001-09-28 18:03:10 <86g097pgqt.fsf@bock.chem.unc.edu> None On 28 Sep 2001, Jochen K?pper wrote:\n\n> Runn...
<20010930142118.16379.qmail@lisboa.ifm.uni-kiel.de> jhauser at ifm.uni-kiel.de (Janko Hauser) [SciPy-dev] Documentation and then some 2001-09-30 14:21:18 None None Hi, I saw the notice for the new documentation...
<5.1.0.14.2.20010930132153.00a7beb8@pop.fas.harvard.edu> wsryu at fas.harvard.edu (William Ryu) [SciPy-dev] Re: Scipy-dev digest, Vol 1 #14 - ... 2001-09-30 17:25:03 <200109301701.f8UH11U00314@scipy.org> None At 12:01 PM 9/30/2001 -0500, you wrote:\n>From...
<20010930214531.16525.qmail@lisboa.ifm.uni-kiel.de> jhauser at ifm.uni-kiel.de (Janko Hauser) [SciPy-dev] Re: Scipy-dev digest, Vol 1 #14 - ... 2001-09-30 21:45:31 <5.1.0.14.2.20010930132153.00a7beb8@pop.fas.ha... <200109301701.f8UH11U00314@scipy.org> <5.1.0.1... Sorry for the last mail if it actually came th...
<wkofnegqee.fsf@lee-morgan.net> python at lee-morgan.net (python at lee-morgan... [SciPy-dev] scipy plt can only plot once? 2001-10-11 15:14:33 None None After investigation its due to a division erro...
<KPEIKEPEIBGNGNKHDILIEEEPCEAA.travis@enthought.com> travis at scipy.org (Travis N. Vaught) [SciPy-dev] scipy plt can only plot once? 2001-10-11 15:22:01 <wkofnegqee.fsf@lee-morgan.net> None plt.figure() should create another wxFrame ins...
<DGEHJAIMAIENCEHBIHCJAEGACIAA.altis@semi-retired.com> altis at semi-retired.com (Kevin Altis) [SciPy-dev] scipy plt can only plot once? 2001-10-11 17:29:01 <KPEIKEPEIBGNGNKHDILIEEEPCEAA.travis@enthought... None I'm new to scipy, but I'm trying out the tutor...
<wk669lmllp.fsf@lee-morgan.net> python at lee-morgan.net (python at lee-morgan... [SciPy-dev] scipy plt can only plot once? 2001-10-12 06:13:54 None None > \n>Travis N. Vaught\n>\n>plt.figure() shoul...
<01eb01c15349$14069be0$c100a8c0@ericlaptop> ej at ee.duke.edu (eric jones) [SciPy-dev] scipy plt can only plot once? 2001-10-12 18:09:46 None <DGEHJAIMAIENCEHBIHCJAEGACIAA.altis@semi-retir... ----- Original Message -----\nFrom: "Kevin Alt...
<DGEHJAIMAIENCEHBIHCJEEJFCIAA.altis@semi-retired.com> altis at semi-retired.com (Kevin Altis) [SciPy-dev] SciPy versus Matlab, Excel, and ot... 2001-10-15 17:39:46 None None I asked a number of friends involved in scienc...
<86r8rzcy55.fsf@bock.chem.unc.edu> jochen at unc.edu (Jochen =?iso-8859-1?q?K=FCp... [SciPy-dev] cvs problems? 2001-10-19 17:55:50 None None -----BEGIN PGP SIGNED MESSAGE-----\nHash: SHA1...
<m34rottvkf.fsf@box.home.de> jochen at jochen-kuepper.de (Jochen =?iso-8859... [SciPy-dev] plt 2001-10-21 05:24:00 None None -----BEGIN PGP SIGNED MESSAGE-----\nHash: SHA1...
<Pine.LNX.4.10.10110211647340.14279-100000@pc103.maths.bris.ac.uk> a.backer at bristol.ac.uk (Arnd Baecker) [SciPy-dev] Link problem with scipy on SUN ? 2001-10-21 20:49:42 None None Hi,\n\nwhen trying to install scipy I ran into...
<033101c15bd7$82d0e8f0$777ba8c0@ericlaptop> ejones17 at austin.rr.com (ERIC JONES) [SciPy-dev] plt 2001-10-23 15:29:26 None <m34rottvkf.fsf@box.home.de> Hey Jochen,\n\n> Is there a good reason not to...
<033501c15bd7$87062e30$777ba8c0@ericlaptop> ejones17 at austin.rr.com (ERIC JONES) [SciPy-dev] cvs problems? 2001-10-23 15:29:38 None <86r8rzcy55.fsf@bock.chem.unc.edu> Sorry bout that. Some files were added and gr...
<033e01c15bdb$b5c38660$777ba8c0@ericlaptop> ejones17 at austin.rr.com (ERIC JONES) [SciPy-dev] Link problem with scipy on SUN ? 2001-10-23 15:59:34 None <Pine.LNX.4.10.10110211647340.14279-100000@pc1... Hey Arnd,\n\nShort answer:\n\nSun has only bee...
<03a801c15be9$a77e3ba0$777ba8c0@ericlaptop> ejones17 at austin.rr.com (ERIC JONES) Fw: [SciPy-dev] Link problem with scipy on SUN ? 2001-10-23 17:39:15 None <Pine.LNX.4.10.10110231837420.17942-100000@pc1... Hey,\n\nSorry that was a typo.\n\nI meant\n\n ...
<m37ktl3kfu.fsf@box.home.de> jochen at jochen-kuepper.de (Jochen =?iso-8859... [SciPy-dev] cvs problems? 2001-10-24 01:17:57 <033501c15bd7$87062e30$777ba8c0@ericlaptop> <86r8rzcy55.fsf@bock.chem.unc.edu>\n\t<033501c... -----BEGIN PGP SIGNED MESSAGE-----\nHash: SHA1...
<m3k7xl1udk.fsf@box.home.de> jochen at jochen-kuepper.de (Jochen =?iso-8859... [SciPy-dev] wxplt 2001-10-24 05:26:15 None None Patch against current cvs, cleaning up wildcar...
<86zo6h2ilo.fsf@bock.chem.unc.edu> jochen at unc.edu (Jochen =?iso-8859-1?q?K=FCp... [SciPy-dev] PlotCanvas 2001-10-24 15:13:50 None None A non-text attachment was scrubbed...\nName: p...
<000b01c15cd1$8755c530$6600a8c0@ericlaptop> ejones17 at austin.rr.com (eric jones) [SciPy-dev] PlotCanvas 2001-10-24 21:19:03 None <86zo6h2ilo.fsf@bock.chem.unc.edu> Hey Jochen,\n\nThe figure() method is actually...
<m3pu7c2z8g.fsf@box.home.de> jochen at jochen-kuepper.de (Jochen =?iso-8859... [SciPy-dev] SciPy versus Matlab, Excel, and ot... 2001-10-25 03:08:15 <DGEHJAIMAIENCEHBIHCJEEJFCIAA.altis@semi-retir... <DGEHJAIMAIENCEHBIHCJEEJFCIAA.altis@semi-retir... Kevin, All,\n\njust stumbled over this. Maybe ...
<m3elns2vkw.fsf_-_@box.home.de> jochen at jochen-kuepper.de (Jochen =?iso-8859... [SciPy-dev] Re: PlotCanvas 2001-10-25 04:27:11 <000b01c15cd1$8755c530$6600a8c0@ericlaptop> <86zo6h2ilo.fsf@bock.chem.unc.edu>\n\t<000b01c... On Wed, 24 Oct 2001 17:19:03 -0400 eric jones ...
<004701c15d55$d889d700$c400a8c0@ericlaptop> ejones17 at austin.rr.com (eric jones) [SciPy-dev] reading ascii files into arrays 2001-10-25 13:06:22 None <18646.1003998017@www7.gmx.net> > Hi Eric !\n>\n> I have decided to use python...
<87vgh3lvh8.fsf@jeeves.blindglobe.net> rossini at blindglobe.net (A.J. Rossini) [SciPy-dev] SciPy versus Matlab, Excel, and ot... 2001-10-25 13:06:43 <m3pu7c2z8g.fsf@box.home.de> <DGEHJAIMAIENCEHBIHCJEEJFCIAA.altis@semi-retir... >>>>> "Jochen" == Jochen K?pper <jochen at joc...
<005e01c15d58$d09845b0$c400a8c0@ericlaptop> ejones17 at austin.rr.com (eric jones) [SciPy-dev] Re: PlotCanvas 2001-10-25 13:27:37 None <86zo6h2ilo.fsf@bock.chem.unc.edu><000b01c15cd... > Not exactly. What I really need is to tell p...
<007501c15d5b$66ae9890$c400a8c0@ericlaptop> ejones17 at austin.rr.com (eric jones) [SciPy-dev] SciPy versus Matlab, Excel, and ot... 2001-10-25 13:46:08 None <DGEHJAIMAIENCEHBIHCJEEJFCIAA.altis@semi-retir... I'm not from the camp of "Excel bad, Command l...
<Pine.LNX.4.33L2.0110251154480.26035-100000@oliphant.ee.byu.edu> oliphant at ee.byu.edu (Travis Oliphant) [SciPy-dev] Reading in arrays. 2001-10-25 16:03:25 <200110251607.f9PG7E526198@scipy.org> None As eric pointed out reading arrays of ASCII da...
<DGEHJAIMAIENCEHBIHCJCEHHCJAA.altis@semi-retired.com> altis at semi-retired.com (Kevin Altis) [SciPy-dev] SciPy versus Matlab, Excel, and ot... 2001-10-25 16:08:00 <87vgh3lvh8.fsf@jeeves.blindglobe.net> None Don't shoot the messenger. :) My purpose in br...
<00f501c15d82$124380f0$c400a8c0@ericlaptop> eric at scipy.org (eric jones) [SciPy-dev] Reading in arrays. 2001-10-25 18:22:57 None <Pine.LNX.4.33L2.0110251154480.26035-100000@ol... > As eric pointed out reading arrays of ASCII ...
... ... ... ... ... ...

20405 rows × 6 columns

Let's look at the matrix of who replies to whom over the whole history of the list, to get a sense of the overall distribution


In [7]:
arx.data[arx.data['In-Reply-To'] > 0][:10]


Out[7]:
From Subject Date In-Reply-To References Body
Message-ID
<Pine.LNX.4.33.0107231957590.15960-100000@oliphant.ee.byu.edu> oliphant at ee.byu.edu (Travis Oliphant) [SciPy-dev] RPMs and source distribution 2001-07-24 02:01:00 <02f001c111bf$2e78a9d0$777ba8c0@190xb01> None I've been playing for hours and finally have i...
<87snfmn8o1.fsf@jeeves.blindglobe.net> rossini at blindglobe.net (A.J. Rossini) [SciPy-dev] RPMs and source distribution 2001-07-24 06:27:42 <Pine.LNX.4.33.0107231957590.15960-100000@olip... <Pine.LNX.4.33.0107231957590.15960-100000@olip... >>>>> "TO" == Travis Oliphant <oliphant at ee....
<Pine.LNX.4.33.0107240041500.16169-100000@oliphant.ee.byu.edu> oliphant at ee.byu.edu (Travis Oliphant) [SciPy-dev] RPMs and source distribution 2001-07-24 06:46:36 <87snfmn8o1.fsf@jeeves.blindglobe.net> None > >>>>> "TO" == Travis Oliphant <oliphant at e...
<87itgisti1.fsf@jeeves.blindglobe.net> rossini at blindglobe.net (A.J. Rossini) [SciPy-dev] RPMs and source distribution 2001-07-24 06:58:46 <Pine.LNX.4.33.0107240041500.16169-100000@olip... <Pine.LNX.4.33.0107240041500.16169-100000@olip... >>>>> "TO" == Travis Oliphant <oliphant at ee....
<Pine.LNX.4.33.0108021551360.838-100000@sugar.ee.byu.edu> oliphant at ee.byu.edu (Travis Oliphant) [SciPy-dev] Spline module 2001-08-02 21:55:24 <200108021700.f72H03D17694@scipy.org> None >\n> Getting the new version of SciPy from CVS...
<200108031856.OAA29753@mud.cgl.uwaterloo.ca> tjlahey at mud.cgl.uwaterloo.ca (Tim Lahey) [SciPy-dev] Re: Splines module 2001-08-03 18:56:28 <200108031700.f73H01D31118@scipy.org> from "sc... None > \n> Date: Thu, 2 Aug 2001 15:55:24 -0600 (MD...
<87u1z1j90m.fsf@phantom.ecn.uiowa.edu> jmr at engineering.uiowa.edu (Joe Reinhardt) [SciPy-dev] RPMs and source distribution 2001-08-21 23:19:53 <87snfmn8o1.fsf@jeeves.blindglobe.net>\n (ross... <Pine.LNX.4.33.0107231957590.15960-100000@olip... Did this debian package ever materialize? \n...
<NEBBIECAMLMAAKHEGPCGCEKICMAA.travis@enthought.com> travis at scipy.org (Travis N. Vaught) [SciPy-dev] RPMs and source distribution 2001-08-22 03:44:22 <87u1z1j90m.fsf@phantom.ecn.uiowa.edu> None We haven't heard anything yet about a working ...
<87u1z01ef6.fsf@jeeves.blindglobe.net> rossini at blindglobe.net (A.J. Rossini) [SciPy-dev] RPMs and source distribution 2001-08-22 12:12:45 <NEBBIECAMLMAAKHEGPCGCEKICMAA.travis@enthought... <NEBBIECAMLMAAKHEGPCGCEKICMAA.travis@enthought... >>>>> "TNV" == Travis N Vaught <travis at scip...
<lg13d6klyt3.fsf@pet.ecn.uiowa.edu> jmr at engineering.uiowa.edu (Joe Reinhardt) [SciPy-dev] RPMs and source distribution 2001-08-22 18:44:24 <87u1z01ef6.fsf@jeeves.blindglobe.net>\n (ross... <NEBBIECAMLMAAKHEGPCGCEKICMAA.travis@enthought... rossini at blindglobe.net (A.J. Rossini) write...

10 rows × 6 columns

In particular we are interested in who replied to who at each time. Recall that this is an open mailing list--everybody potentially reads each message. A response from A to B is an indication that A read B's original message. Therefore, a response indicates not just a single message from A to B, but an exchange from B to A and back again.

Below we modify our data to see who replied to whom.


In [8]:
messages = arx.data[['From']]

responses = arx.data[arx.data['In-Reply-To'] > 0][['From','Date','In-Reply-To']]

exchanges = pd.merge(messages,responses,how='inner',right_on='In-Reply-To',left_index=True,suffixes=['_original','_response'])

exchanges


Out[8]:
From_original From_response Date In-Reply-To
Message-ID
<87snfmn8o1.fsf@jeeves.blindglobe.net> oliphant at ee.byu.edu (Travis Oliphant) rossini at blindglobe.net (A.J. Rossini) 2001-07-24 06:27:42 <Pine.LNX.4.33.0107231957590.15960-100000@olip...
<Pine.LNX.4.33.0107240041500.16169-100000@oliphant.ee.byu.edu> rossini at blindglobe.net (A.J. Rossini) oliphant at ee.byu.edu (Travis Oliphant) 2001-07-24 06:46:36 <87snfmn8o1.fsf@jeeves.blindglobe.net>
<87itgisti1.fsf@jeeves.blindglobe.net> oliphant at ee.byu.edu (Travis Oliphant) rossini at blindglobe.net (A.J. Rossini) 2001-07-24 06:58:46 <Pine.LNX.4.33.0107240041500.16169-100000@olip...
<NEBBIECAMLMAAKHEGPCGCEKICMAA.travis@enthought.com> jmr at engineering.uiowa.edu (Joe Reinhardt) travis at scipy.org (Travis N. Vaught) 2001-08-22 03:44:22 <87u1z1j90m.fsf@phantom.ecn.uiowa.edu>
<87u1z01ef6.fsf@jeeves.blindglobe.net> travis at scipy.org (Travis N. Vaught) rossini at blindglobe.net (A.J. Rossini) 2001-08-22 12:12:45 <NEBBIECAMLMAAKHEGPCGCEKICMAA.travis@enthought...
<20010927164848.Q1356@dothill.com> jochen at unc.edu (Jochen =?iso-8859-1?q?K=FCp... jason at tishler.net (Jason Tishler) 2001-09-27 20:48:48 <86d74cr9q2.fsf@bock.chem.unc.edu>
<86g097pgqt.fsf@bock.chem.unc.edu> jochen at unc.edu (Jochen =?iso-8859-1?q?K=FCp... jochen at jochen-kuepper.de (Jochen =?iso-8859... 2001-09-28 14:48:13 <86d74cr9q2.fsf@bock.chem.unc.edu>
<Pine.LNX.4.21.0109271959570.22161-100000@cens.ioc.ee> oliphant at ee.byu.edu (Travis Oliphant) pearu at cens.ioc.ee (Pearu Peterson) 2001-09-27 18:10:57 <Pine.LNX.4.33.0109271117580.25675-100000@olip...
<Pine.LNX.4.21.0109272015040.22161-100000@cens.ioc.ee> pearu at cens.ioc.ee (Pearu Peterson) pearu at cens.ioc.ee (Pearu Peterson) 2001-09-27 18:16:32 <Pine.LNX.4.21.0109271959570.22161-100000@cens...
<20010927124852.A18567@myrddin.caltech.edu> pearu at cens.ioc.ee (Pearu Peterson) kern at caltech.edu (Robert Kern) 2001-09-27 19:48:52 <Pine.LNX.4.21.0109271959570.22161-100000@cens...
<20010928135700.X1356@dothill.com> jochen at jochen-kuepper.de (Jochen =?iso-8859... jason at tishler.net (Jason Tishler) 2001-09-28 17:57:00 <86g097pgqt.fsf@bock.chem.unc.edu>
<Pine.LNX.4.21.0109281946030.31117-100000@cens.ioc.ee> jochen at jochen-kuepper.de (Jochen =?iso-8859... pearu at cens.ioc.ee (Pearu Peterson) 2001-09-28 18:03:10 <86g097pgqt.fsf@bock.chem.unc.edu>
<20010930214531.16525.qmail@lisboa.ifm.uni-kiel.de> wsryu at fas.harvard.edu (William Ryu) jhauser at ifm.uni-kiel.de (Janko Hauser) 2001-09-30 21:45:31 <5.1.0.14.2.20010930132153.00a7beb8@pop.fas.ha...
<KPEIKEPEIBGNGNKHDILIEEEPCEAA.travis@enthought.com> python at lee-morgan.net (python at lee-morgan... travis at scipy.org (Travis N. Vaught) 2001-10-11 15:22:01 <wkofnegqee.fsf@lee-morgan.net>
<DGEHJAIMAIENCEHBIHCJAEGACIAA.altis@semi-retired.com> travis at scipy.org (Travis N. Vaught) altis at semi-retired.com (Kevin Altis) 2001-10-11 17:29:01 <KPEIKEPEIBGNGNKHDILIEEEPCEAA.travis@enthought...
<m3pu7c2z8g.fsf@box.home.de> altis at semi-retired.com (Kevin Altis) jochen at jochen-kuepper.de (Jochen =?iso-8859... 2001-10-25 03:08:15 <DGEHJAIMAIENCEHBIHCJEEJFCIAA.altis@semi-retir...
<m37ktl3kfu.fsf@box.home.de> ejones17 at austin.rr.com (ERIC JONES) jochen at jochen-kuepper.de (Jochen =?iso-8859... 2001-10-24 01:17:57 <033501c15bd7$87062e30$777ba8c0@ericlaptop>
<m3elns2vkw.fsf_-_@box.home.de> ejones17 at austin.rr.com (eric jones) jochen at jochen-kuepper.de (Jochen =?iso-8859... 2001-10-25 04:27:11 <000b01c15cd1$8755c530$6600a8c0@ericlaptop>
<87vgh3lvh8.fsf@jeeves.blindglobe.net> jochen at jochen-kuepper.de (Jochen =?iso-8859... rossini at blindglobe.net (A.J. Rossini) 2001-10-25 13:06:43 <m3pu7c2z8g.fsf@box.home.de>
<DGEHJAIMAIENCEHBIHCJCEHHCJAA.altis@semi-retired.com> rossini at blindglobe.net (A.J. Rossini) altis at semi-retired.com (Kevin Altis) 2001-10-25 16:08:00 <87vgh3lvh8.fsf@jeeves.blindglobe.net>
<m3zo6fulnt.fsf@box.home.de> ejones17 at austin.rr.com (eric jones) jochen at jochen-kuepper.de (Jochen =?iso-8859... 2001-10-26 03:26:14 <005e01c15d58$d09845b0$c400a8c0@ericlaptop>
<86elnqp57e.fsf@bock.chem.unc.edu> eric at scipy.org (eric jones) jochen at unc.edu (Jochen =?iso-8859-1?q?K=FCp... 2001-10-26 19:32:53 <028501c15e32$b479c720$c400a8c0@ericlaptop>
<KPEIKEPEIBGNGNKHDILIEEACCFAA.travis@enthought.com> jochen at unc.edu (Jochen =?iso-8859-1?q?K=FCp... travis at scipy.org (Travis N. Vaught) 2001-10-29 21:00:09 <868zdu4219.fsf@bock.chem.unc.edu>
<86snc22gth.fsf@bock.chem.unc.edu> travis at scipy.org (Travis N. Vaught) jochen at unc.edu (Jochen =?iso-8859-1?q?K=FCp... 2001-10-29 22:59:38 <KPEIKEPEIBGNGNKHDILIEEACCFAA.travis@enthought...
<87zo66mpir.fsf@jeeves.blindglobe.net> jochen at unc.edu (Jochen =?iso-8859-1?q?K=FCp... rossini at blindglobe.net (A.J. Rossini) 2001-11-01 16:20:28 <864roeija9.fsf@bock.chem.unc.edu>
<87u1w9qhy9.fsf@jeeves.blindglobe.net> eric at scipy.org (eric) rossini at blindglobe.net (A.J. Rossini) 2001-11-05 16:51:42 <060301c16609$ec43a700$777ba8c0@ericlaptop>
<Pine.GSO.3.96.1011105125110.2400K-100000@bucky.nmr.mgh.harvard.edu> eric at scipy.org (eric) strang at nmr.mgh.harvard.edu (Gary Strangman) 2001-11-05 18:15:09 <060301c16609$ec43a700$777ba8c0@ericlaptop>
<5.1.0.14.0.20011106122104.02e93420@localhost> eric at scipy.org (eric) magnus at thinkware.se (Magnus =?iso-8859-1?Q?... 2001-11-06 11:40:53 <062f01c1660b$87b2f410$777ba8c0@ericlaptop>
<5.1.0.14.0.20011109121855.0211c178@localhost> eric at scipy.org (eric) magnus at thinkware.se (Magnus =?iso-8859-1?Q?... 2001-11-09 12:35:46 <086c01c166d4$7d8e9240$777ba8c0@ericlaptop>
<5.1.0.14.0.20011109175409.02126880@localhost> eric at scipy.org (eric) magnus at thinkware.se (Magnus =?iso-8859-1?Q?... 2001-11-09 17:12:30 <012901c16930$ee3d5c70$c300a8c0@ericlaptop>
<86wv0zsml4.fsf@bock.chem.unc.edu> eric at scipy.org (eric) jochen at unc.edu (Jochen =?iso-8859-1?q?K=FCp... 2001-11-09 20:42:15 <012901c16930$ee3d5c70$c300a8c0@ericlaptop>
<m3eln7pe25.fsf@box.home.de> rlytle at tqs.com (Lytle, Robert TQO) jochen at jochen-kuepper.de (Jochen =?iso-8859... 2001-11-10 02:14:42 <81E1D2E15CCBD311A74700A0C9E1CC8E03981C4F@chun...
<m3r8qjy8jb.fsf@box.home.de> arnd.baecker at physik.uni-ulm.de (arnd.baecke... jochen at jochen-kuepper.de (Jochen =?iso-8859... 2001-11-28 01:43:36 <Pine.SOL.4.41.0111211110060.27853-100000@wega...
<87itb3bwla.fsf@jeeves.blindglobe.net> jmr at engineering.uiowa.edu (Joe Reinhardt) rossini at blindglobe.net (A.J. Rossini) 2001-12-19 05:36:01 <87itb4huyn.fsf_-_@phantom.ecn.uiowa.edu>
<15411.52223.166321.116325@monster.linux.in> eric at scipy.org (eric) prabhu at aero.iitm.ernet.in (Prabhu Ramachand... 2002-01-03 03:11:59 <039701c193e1$4a384190$777ba8c0@ericlaptop>
<15412.1938.232288.17357@cyberwaveindia.com> eric at scipy.org (eric) prabhu at aero.iitm.ernet.in (Prabhu Ramachand... 2002-01-03 07:26:10 <03c001c19416$bf157390$777ba8c0@ericlaptop>
<Pine.OSF.3.96.1020103022918.9936A-100000@longs.colorado.edu> eric at scipy.org (eric) fperez at pizero.colorado.edu (=?ISO-8859-1?Q?... 2002-01-03 09:31:24 <03c001c19416$bf157390$777ba8c0@ericlaptop>
<Pine.OSF.3.96.1020104161620.14988A-100000@longs.colorado.edu> eric at scipy.org (eric) fperez at pizero.colorado.edu (=?ISO-8859-1?Q?... 2002-01-04 23:16:40 <062601c19569$984f67d0$777ba8c0@ericlaptop>
<Pine.OSF.3.96.1020104164223.15246A-100000@longs.colorado.edu> eric at scipy.org (eric) fperez at pizero.colorado.edu (=?ISO-8859-1?Q?... 2002-01-04 23:43:13 <062601c19569$984f67d0$777ba8c0@ericlaptop>
<15415.15280.164483.718499@monster.linux.in> eric at scipy.org (eric) prabhu at aero.iitm.ernet.in (Prabhu Ramachand... 2002-01-05 17:45:20 <062601c19569$984f67d0$777ba8c0@ericlaptop>
<Pine.OSF.3.96.1020104171442.14666A-100000@longs.colorado.edu> eric at scipy.org (eric) fperez at pizero.colorado.edu (=?ISO-8859-1?Q?... 2002-01-05 00:30:21 <066501c19574$f00970a0$777ba8c0@ericlaptop>
<KPEIKEPEIBGNGNKHDILIGECLCNAA.travis@enthought.com> fperez at pizero.colorado.edu (=?ISO-8859-1?Q?... travis at scipy.org (Travis N. Vaught) 2002-01-05 03:58:46 <Pine.OSF.3.96.1020104161620.14988A-100000@lon...
<Pine.OSF.3.96.1020105012243.18316A-100000@longs.colorado.edu> eric at scipy.org (eric) fperez at pizero.colorado.edu (=?ISO-8859-1?Q?... 2002-01-05 08:31:28 <069d01c195a3$e63d7600$777ba8c0@ericlaptop>
<Pine.OSF.3.96.1020105015143.18316B-100000@longs.colorado.edu> eric at scipy.org (eric) fperez at pizero.colorado.edu (=?ISO-8859-1?Q?... 2002-01-05 08:52:05 <069d01c195a3$e63d7600$777ba8c0@ericlaptop>
<Pine.LNX.4.21.0201051039390.31153-100000@cens.ioc.ee> fperez at pizero.colorado.edu (=?ISO-8859-1?Q?... pearu at cens.ioc.ee (Pearu Peterson) 2002-01-05 08:53:43 <Pine.OSF.3.96.1020105012243.18316A-100000@lon...
<15415.15570.95637.888919@monster.linux.in> pearu at cens.ioc.ee (Pearu Peterson) prabhu at aero.iitm.ernet.in (Prabhu Ramachand... 2002-01-05 17:50:10 <Pine.LNX.4.21.0201051408420.2425-100000@cens....
<KPEIKEPEIBGNGNKHDILIMECPCNAA.travis@enthought.com> rob at pythonemproject.com (Rob) travis at scipy.org (Travis N. Vaught) 2002-01-05 17:08:02 <3C3724F3.27F0C8EF@pythonemproject.com>
<15415.17841.136421.215483@monster.linux.in> eric at scipy.org (eric) prabhu at aero.iitm.ernet.in (Prabhu Ramachand... 2002-01-05 18:28:01 <071f01c1960c$83a11330$777ba8c0@ericlaptop>
<Pine.LNX.4.21.0201052009490.8831-100000@cens.ioc.ee> prabhu at aero.iitm.ernet.in (Prabhu Ramachand... pearu at cens.ioc.ee (Pearu Peterson) 2002-01-05 18:22:51 <15415.15570.95637.888919@monster.linux.in>
<15417.3794.691758.303749@monster.linux.in> pearu at cens.ioc.ee (Pearu Peterson) prabhu at aero.iitm.ernet.in (Prabhu Ramachand... 2002-01-07 02:58:26 <Pine.LNX.4.21.0201052009490.8831-100000@cens....
<Pine.LNX.4.21.0201070953120.25046-100000@cens.ioc.ee> prabhu at aero.iitm.ernet.in (Prabhu Ramachand... pearu at cens.ioc.ee (Pearu Peterson) 2002-01-07 07:59:46 <15417.3794.691758.303749@monster.linux.in>
<15417.24570.448897.692367@monster.linux.in> eric at scipy.org (eric) prabhu at aero.iitm.ernet.in (Prabhu Ramachand... 2002-01-07 08:44:42 <097101c19747$6026c3f0$777ba8c0@ericlaptop>
<15417.51205.897078.969051@monster.linux.in> eric at scipy.org (eric) prabhu at aero.iitm.ernet.in (Prabhu Ramachand... 2002-01-07 16:08:37 <09f801c1978a$51ecd0c0$777ba8c0@ericlaptop>
<Pine.OSF.3.96.1020107131114.5706B-100000@longs.colorado.edu> eric at scipy.org (eric) fperez at pizero.colorado.edu (=?ISO-8859-1?Q?... 2002-01-07 20:13:25 <0a3d01c1978e$4eb5c4d0$777ba8c0@ericlaptop>
<Pine.OSF.3.96.1020107130744.5706A-100000@longs.colorado.edu> prabhu at aero.iitm.ernet.in (Prabhu Ramachand... fperez at pizero.colorado.edu (=?ISO-8859-1?Q?... 2002-01-07 20:09:53 <15417.51205.897078.969051@monster.linux.in>
<15418.20545.187001.297457@monster.linux.in> fperez at pizero.colorado.edu (=?ISO-8859-1?Q?... prabhu at aero.iitm.ernet.in (Prabhu Ramachand... 2002-01-08 01:49:53 <Pine.OSF.3.96.1020107130744.5706A-100000@long...
<Pine.LNX.4.21.0201101028100.14417-100000@cens.ioc.ee> eric at scipy.org (eric) pearu at cens.ioc.ee (Pearu Peterson) 2002-01-10 08:39:25 <01ab01c1995f$7021f120$6b01a8c0@ericlaptop>
<15421.33717.327232.805427@monster.linux.in> eric at scipy.org (eric) prabhu at aero.iitm.ernet.in (Prabhu Ramachand... 2002-01-10 12:06:13 <01ab01c1995f$7021f120$6b01a8c0@ericlaptop>
<15421.48924.58063.901822@monster.linux.in> eric at scipy.org (eric) prabhu at aero.iitm.ernet.in (Prabhu Ramachand... 2002-01-10 16:19:40 <003001c199e4$4daf4590$6b01a8c0@ericlaptop>
<Pine.LNX.4.21.0201101919020.25482-100000@cens.ioc.ee> eric at scipy.org (eric) pearu at cens.ioc.ee (Pearu Peterson) 2002-01-10 17:22:43 <007701c199ef$7518fd50$6b01a8c0@ericlaptop>
... ... ... ...

14216 rows × 4 columns


In [9]:
exchanges.groupby(['From_original','From_response']).count()


Out[9]:
From_original From_response Date In-Reply-To
From_original From_response
00ai99@gmail.... (David Gowers (kampu)) gael.varoquaux@normalesup.... (Gael Varoquaux) 1 1 1 1
14366037@sun.ac... (Gouws, Almero <14366037@sun.ac.za>) ed@lamedomain.... (Ed Rahn) 1 1 1 1
matthieu.brucher@gmail.... (Matthieu Brucher) 1 1 1 1
14772361@sun.ac... (Hamman, RA, Mr <14772361@sun.ac.za>) d.l.goldsmith@gmail.... (David Goldsmith) 1 1 1 1
josh.holbrook@gmail.... (Joshua Holbrook) 1 1 1 1
Adam.Raczkowski@tufts.... (AJ Raczkowski) stefan@sun.ac... (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=) 1 1 1 1
Alex.Perry at ge.com (Perry, Alexander (GE Infrastructure)) oliphant at ee.byu.edu (Travis Oliphant) 1 1 1 1
rkern at ucsd.edu (Robert Kern) 2 2 2 2
Andreas.Floeter@web... (Andreas =?iso-8859-1?q?Fl=F6ter?=) kamrik@gmail.... (Mark Koudritsky) 1 1 1 1
Balazs.Nemeth at shaw.ca (Balazs Nemeth) cookedm at physics.mcmaster.ca (David M. Cooke) 1 1 1 1
Bernhard.Hoefle at uibk.ac.at (Bernhard Reimar Hoefle) robert.kern at gmail.com (Robert Kern) 1 1 1 1
Brian.Newsom@Colorado.... (Brian Lee Newsom) charlesnwoods@gmail.... (Gmail) 1 1 1 1
charlesnwoods@gmail.... (Nathan Woods) 2 2 2 2
pav@iki... (Pauli Virtanen) 2 2 2 2
Chris.Barker at noaa.gov (Chris Barker) fperez at colorado.edu (Fernando Perez) 1 1 1 1
Chris.Fonnesbeck at MyFWC.com (Fonnesbeck, Chris) perry at stsci.edu (Perry Greenfield) 1 1 1 1
Chuck.Harris at sdl.usu.edu (Chuck Harris) clee at spiralis.merseine.nu (clee at spiralis.merseine.nu) 1 1 1 1
falted at openlc.org (Francesc Alted) 1 1 1 1
jochen at jochen-kuepper.de (Jochen =?iso-8859-1?q?K=FCpper?=) 1 1 1 1
jochen at unc.edu (Jochen =?iso-8859-1?q?K=FCpper?=) 1 1 1 1
oliphant at ee.byu.edu (Travis Oliphant) 7 7 7 7
oliphant.travis at ieee.org (Travis Oliphant) 3 3 3 3
pearu at cens.ioc.ee (Pearu Peterson) 5 5 5 5
pearu at scipy.org (Pearu Peterson) 2 2 2 2
pearu at scipy.org (pearu at scipy.org) 8 8 8 8
D.J.Baker@soton.ac... (Baker D.J.) scipy@samueljohn... (Samuel John) 1 1 1 1
DavidA at ActiveState.com (David Ascher) joe at enthought.com (Joe Cooper) 1 1 1 1
pearu at cens.ioc.ee (Pearu Peterson) 1 1 1 1
perry at stsci.edu (Perry Greenfield) 1 1 1 1
Deil.Christoph@googlemail.... (Christoph Deil) charlesr.harris@gmail.... (Charles R Harris) 1 1 1 1
denis.laxalde@mcgill... (Denis Laxalde) 1 1 1 1
Doug.LATORNELL at mdsinc.com (LATORNELL, Doug) robert.kern at gmail.com (Robert Kern) 1 1 1 1
Dwf@cs.toronto.... (David Warde-Farley) david@ar.media.kyoto-u.ac... (David Cournapeau) 1 1 1 1
robert.kern@gmail.... (Robert Kern) 1 1 1 1
Eric.Buehler at smiths-aerospace.com (Buehler, Eric (AGRE)) oliphant.travis at ieee.org (Travis Oliphant) 1 1 1 1
Fernando.Perez at colorado.edu (Fernando Perez) Fernando.Perez at colorado.edu (Fernando Perez) 7 7 7 7
M.Reimpell at tu-bs.de (Michael Reimpell) 1 1 1 1
Norbert.Nemec.list at gmx.de (Norbert Nemec) 2 2 2 2
aisaac at american.edu (Alan G Isaac) 6 6 6 6
arnd.baecker at web.de (Arnd Baecker) 6 6 6 6
bgoli at sun.ac.za (Brett Olivier) 1 1 1 1
bulatov at cs.orst.edu (bulatov at cs.orst.edu) 1 1 1 1
chanley at stsci.edu (Christopher Hanley) 1 1 1 1
charles.harris at sdl.usu.edu (Charles Harris) 2 2 2 2
charles.harris at sdl.usu.edu (Charles R Harris) 1 1 1 1
charlesr.harris at gmail.com (Charles R Harris) 5 5 5 5
chris at fisher.forestry.uga.edu (Christopher Fonnesbeck) 1 1 1 1
chris at trichech.us (Christopher Fonnesbeck) 1 1 1 1
cimrman3 at ntc.zcu.cz (Robert Cimrman) 1 1 1 1
cookedm at physics.mcmaster.ca (David M. Cooke) 2 2 2 2
eric at enthought.com (eric jones) 10 10 10 10
eric at enthought.com (eric_imap) 1 1 1 1
faltet at carabos.com (Francesc Altet) 2 2 2 2
golux at comcast.net (Stephen C. Waterbury) 1 1 1 1
gruben at bigpond.net.au (Gary Ruben) 1 1 1 1
guyer at nist.gov (Jonathan Guyer) 4 4 4 4
joe at enthought.com (Joe Cooper) 6 6 6 6
jonathan.taylor at utoronto.ca (Jonathan Taylor) 2 2 2 2
kamrik at gmail.com (Mark Koudritsky) 2 2 2 2
nwagner at mecha.uni-stuttgart.de (Nils Wagner) 2 2 2 2
... ... ... ...

5538 rows × 4 columns

The next step is to create a DataFrame that for each pair A and B:

  • The duration of time between the first reply between that pair and the last.
  • The total number of replies from A to B, $r_{AB}$, and from B to A, $r_{BA}$.
  • The reciprocity of the conversation $min(r_{AB},r_{BA})/max(r_{AB},r_{BA})$

Using the exchanges data frame we can use the functions in twopeople.py to calculate the above quantities. The cell below gives a sample output for calculating duration, number of replies, and reciprocity for two specific contributers.


In [10]:
twoppl.duration(exchanges, "oliphant at ee.byu.edu (Travis Oliphant)", "rossini at blindglobe.net (A.J. Rossini)"  )
twoppl.panda_pair(exchanges, "oliphant at ee.byu.edu (Travis Oliphant)", "rossini at blindglobe.net (A.J. Rossini)"  )


Out[10]:
A B duration num_replies reciprocity
0 oliphant at ee.byu.edu (Travis Oliphant) rossini at blindglobe.net (A.J. Rossini) 465 days, 14:33:48 4 0.333333

1 rows × 5 columns

The "duration" column gives us a datetime data structure so we must be careful later on when extracting information from this column.


In [11]:
twoppl.duration(exchanges, "oliphant at ee.byu.edu (Travis Oliphant)", "rossini at blindglobe.net (A.J. Rossini)"  )


Out[11]:
datetime.timedelta(465, 52428)

Now let's create a dataframe that consists of these three quantities (duration, number of replies, and reciprocity) for each pair of contributers who interacted.

To get all the unique pairs we can use the unique_pairs function in twopeople.py. This willl give us all pairs between any two individuals who had some degree of interation.

Note: The unique pairs we get back do not include reversed pairs. For example, if one of the pairs was ("Bob", "Mary"), we would not have ("Mary", "Bob") in our output.


In [12]:
pairs = twoppl.unique_pairs(exchanges)

Using panda_allpairs, we can create the desired data frame


In [13]:
allpairs = twoppl.panda_allpairs(exchanges, pairs)
allpairs


Out[13]:
A B duration num_replies reciprocity
0 fperez.net@gmail.... (Fernando Perez) michael.abshoff@googlemail.... (Michael Abshoff) 00:00:00 1 0.000000
1 luethi@vaw.baug.ethz... (Martin =?ISO-8859-1?Q... rob.clewley@gmail.... (Rob Clewley) 00:00:00 1 0.000000
2 nils106@googlemail.... (Nils Wagner) ralf.gommers@gmail.... (Ralf Gommers) 543 days, 13:42:28 16 0.777778
3 lists@hilboll... (Andreas H.) lists@hilboll... (Andreas H.) 00:00:00 2 1.000000
4 charlesr.harris@gmail.... (Charles R Harris) aisaac@american.... (Alan G Isaac) 371 days, 23:57:10 2 1.000000
5 thomas.haslwanter@fh-linz... (Thomas Haslwanter) jjstickel@gmail.... (Jonathan Stickel) 00:00:00 1 0.000000
6 aric.hagberg@gmail.... (Aric Hagberg) niall.moran@gmail.... (Niall Moran) 00:00:00 1 0.000000
7 14772361@sun.ac... (Hamman, RA, Mr <14772361@s... josh.holbrook@gmail.... (Joshua Holbrook) 00:00:00 1 0.000000
8 mierle@gmail.... (Keir Mierle) stefan@sun.ac... (=?ISO-8859-1?Q?St=E9fan_van_... 00:00:00 1 0.000000
9 d_l_goldsmith@yahoo.... (David Goldsmith) jsseabold@gmail.... (Skipper Seabold) 00:00:00 1 0.000000
10 ericfode@gmail.... (Fode) ralf.gommers@googlemail.... (Ralf Gommers) 00:00:00 1 0.000000
11 eric at scipy.org (eric) pearu at scipy.org (pearu at scipy.org) 37 days, 00:24:16 18 0.000000
12 david.kremer.dk@gmail.... (David Kremer) jason-sage@creativetrax.... (Jason Grout) 01:09:23 2 1.000000
13 pwang@enthought.... (Peter Wang) pwang@enthought.... (Peter Wang) 264 days, 03:27:15 6 1.000000
14 ndbecker2@gmail.... (Neal Becker) jsseabold@gmail.... (Skipper Seabold) 00:00:00 1 0.000000
15 arkapravobhaumik@gmail.... (Arkapravo Bhaumik) david@ar.media.kyoto-u.ac... (David Cournapeau) 00:00:00 1 0.000000
16 jba@SDF.LONESTAR.... (Jeffrey Armstrong) josef.pktd@gmai... (josef.pktd@gmai...) 00:00:00 1 0.000000
17 collinstocks@gmail.... (Collin Stocks) ralf.gommers@googlemail.... (Ralf Gommers) 36 days, 05:07:48 3 0.500000
18 pav+sp@iki... (Pauli Virtanen) charlesr.harris@gmail.... (Charles R Harris) 00:49:01 2 0.000000
19 josef.pktd@gmai... (josef.pktd@gmai...) denis@laxalde.... (Denis Laxalde) 27 days, 12:09:07 4 1.000000
20 ljosa@broad.mit.... (Vebjorn Ljosa) david@ar.media.kyoto-u.ac... (David Cournapeau) 00:00:00 1 0.000000
21 pav@iki... (Pauli Virtanen) scopatz@gmail.... (Anthony Scopatz) 27 days, 15:21:06 3 0.000000
22 oliphant.travis at ieee.org (Travis Oliphant) kwgoodman at gmail.com (Keith Goodman) 00:16:02 3 0.500000
23 josef.pktd@gmai... (josef.pktd@gmai...) jtravs@gmail.... (John Travers) 00:55:41 2 1.000000
24 moritz.beber@googlemail.... (Moritz Emanuel Be... josef.pktd@gmai... (josef.pktd@gmai...) 00:00:00 1 0.000000
25 nkilleen@unimelb.edu... (Neil Killeen) dwf@cs.toronto.... (David Warde-Farley) 2 days, 00:55:53 5 0.666667
26 nwagner@iam.uni-stuttgart... (Nils Wagner) jba@sdf.lonestar.... (Jeffrey Armstrong) 10 days, 23:07:37 2 1.000000
27 opossumnano@gmail.... (Tiziano Zito) nwagner@iam.uni-stuttgart... (Nils Wagner) 11 days, 09:32:57 3 0.000000
28 aarchiba@physics.mcgill... (Anne Archibald) david@silveregg.co... (David) 00:00:00 1 0.000000
29 eric at scipy.org (eric) oliphant at ee.byu.edu (Travis Oliphant) 13 days, 03:28:33 3 0.000000
30 matthew.brett@gmail.... (Matthew Brett) matthew.brett@gmail.... (Matthew Brett) 2572 days, 18:07:19 52 1.000000
31 warren.weckesser@enthought.... (Warren Weckesser) fboulogne@sciunto.... (=?ISO-8859-1?Q?Fran=E7o... 00:00:00 1 0.000000
32 fbreitling@aip... (Frank Breitling) sturla@molden... (Sturla Molden) 53 days, 19:26:00 14 1.000000
33 cookedm at physics.mcmaster.ca (David M. Cooke) cookedm at physics.mcmaster.ca (David M. Cooke) 290 days, 01:54:53 6 1.000000
34 eads@soe.ucsc.... (Damian Eads) david@ar.media.kyoto-u.ac... (David Cournapeau) 352 days, 19:11:58 7 0.750000
35 byrnes at bu.edu (John Byrnes) byrnes at bu.edu (John Byrnes) 00:00:00 2 1.000000
36 rudolphv at gmail.com (Rudolph van der Merwe) robert.kern at gmail.com (Robert Kern) 00:44:30 4 1.000000
37 matthew.brett@gmail.... (Matthew Brett) faltet@carabos.... (Francesc Altet) 00:00:00 1 0.000000
38 jh at oobleck.astro.cornell.edu (Joe Harrington) rkern at ucsd.edu (Robert Kern) 372 days, 12:21:38 3 0.000000
39 josef.pktd@gmai... (josef.pktd@gmai...) bsouthey@gmail.... (Bruce Southey) 1138 days, 10:18:52 64 0.939394
40 jsseabold@gmail.... (Skipper Seabold) lists@hilboll... (Andreas Hilboll) 00:00:00 1 0.000000
41 mmetz at astro.uni-bonn.de (Manuel Metz) prabhu at aero.iitm.ernet.in (Prabhu Ramachand... 00:00:00 1 0.000000
42 warren.weckesser@enthought.... (Warren Weckesser) dwf@cs.toronto.... (David Warde-Farley) 00:00:00 1 0.000000
43 gokhansever@gmail.... (=?UTF-8?Q?G=C3=B6khan_S... cgohlke@uci.... (Christoph Gohlke) 21:30:21 4 1.000000
44 eric at scipy.org (eric) jhauser at ifm.uni-kiel.de (Janko Hauser) 00:00:00 1 0.000000
45 scopatz@gmail.... (Anthony Scopatz) bsouthey@gmail.... (Bruce Southey) 184 days, 18:21:52 4 0.333333
46 nwagner at mecha.uni-stuttgart.de (Nils Wagner) cimrman3 at ntc.zcu.cz (Robert Cimrman) 81 days, 02:12:43 13 0.857143
47 guyer at nist.gov (Jonathan Guyer) schofield at ftw.at (Ed Schofield) 77 days, 06:06:13 6 1.000000
48 mattknox_ca@hotmail.... (Matt Knox) millman@berkeley.... (Jarrod Millman) 00:00:00 1 0.000000
49 dagss@student.matnat.uio... (Dag Sverre Seljeb... josef.pktd@gmai... (josef.pktd@gmai...) 200 days, 01:28:55 4 0.333333
50 eric at scipy.org (eric) arnd.baecker at physik.uni-ulm.de (arnd.baecke... 00:00:00 1 0.000000
51 robert.kern@gmail.... (Robert Kern) perry@stsci.... (Perry Greenfield) 193 days, 01:20:35 7 0.750000
52 rkern at ucsd.edu (Robert Kern) oliphant at ee.byu.edu (Travis Oliphant) 478 days, 10:47:11 55 0.896552
53 mcp.stras@gmail.... (Martin Campos Pinto) pav@iki... (Pauli Virtanen) 9 days, 19:07:23 5 0.666667
54 ondrej@certik... (Ondrej Certik) fredrik.johansson@gmail.... (Fredrik Johansson) 00:00:00 1 0.000000
55 pav@iki... (Pauli Virtanen) prabhu@aero.iitb.ac... (Prabhu Ramachandran) 00:39:00 2 0.000000
56 nwagner@iam.uni-stuttgart... (Nils Wagner) aric.hagberg@gmail.... (Aric Hagberg) 00:00:00 1 0.000000
57 juanlu001@gmail.... (Juan Luis Cano) juanlu001@gmail.... (Juan Luis Cano) 37 days, 15:42:24 8 1.000000
58 jonas at mwl.mit.edu (Eric Jonas) jonathan.taylor at stanford.edu (Jonathan Taylor) 00:00:00 1 0.000000
59 robert.kern@gmail.... (Robert Kern) kwgoodman@gmail.... (Keith Goodman) 102 days, 06:54:22 2 0.000000
... ... ... ... ...

3992 rows × 5 columns

The printed values below display the counts associated for a given number of replies.

We can see that many of the interactions have a relatively small amount of replies. One possible explanation for this large amount may be having many interactions that are merely a follow-up or question followed by a thank you response (we can explore this further by parsing the message bodies in arx.data but for now we will just speculate).


In [14]:
print "corresponding counts for number of replies up to 19"
print ("number of replies", "frequency of occurence")
for i in range(20):
    print (i, len(allpairs[allpairs['num_replies'] <= i]) - len(allpairs[allpairs['num_replies'] <= i - 1]))


corresponding counts for number of replies up to 19
('number of replies', 'frequency of occurence')
(0, 0)
(1, 1895)
(2, 805)
(3, 328)
(4, 257)
(5, 120)
(6, 101)
(7, 68)
(8, 67)
(9, 41)
(10, 46)
(11, 23)
(12, 27)
(13, 14)
(14, 16)
(15, 12)
(16, 20)
(17, 8)
(18, 13)
(19, 9)

In [15]:
plt.hist(allpairs['num_replies'])
plt.title("Number of replies")


Out[15]:
<matplotlib.text.Text at 0x107b01450>

To get a better idea of what's going on for larger values, let's look at the rows that have number of replies greater than 5.


In [16]:
greaterThanFive = allpairs[allpairs['num_replies'] > 5]['num_replies']
counts = greaterThanFive.value_counts()
counts.plot()


Out[16]:
<matplotlib.axes.AxesSubplot at 0x10c94d810>

The graph seems to follow a power law which is expected for this type of data.

Now let's see if we can find any patters between the number of replies and reciprocity. Intuitively, we would expect that the number of replies be positively associated with reciprocity but let's see...

We will first look at the data for which the number of replies is greater than 5 to possibly get rid of some noise (later on we will explore the data without removing these entries).

Below we divided reciprocity and number of replies into completely arbitrary bins as shown below. Hopefully, this will make it easier to see patters between these two variables as they have quite a bit of noise.

(The genId and genNumReplies functions just give each entry a corresponding label for graphing purposes later on. These labels are based on which bin a given entry falls under)


In [17]:
#Completely arbitrary bins

#Group A reciprocity between (0, .25]
#Group B reciprocity between (.25, .5] 
#Group C reciprocity between (.5, .75]  
#Group D reciprocity between (.75, 1.00]

#"low" number of replies less than or equal to 10
#"moderate" number of replies between 10 and 20
#"high" replies greater than 20 replies 

def genId(num):
    if num <= .25:
        return 'A.(0, .25]'
    if num <= .5:
        return "B.(.25, .5]"
    if num <= .75:
        return "C.(.5, .75]"
    return "D.(.75, 1.00]"

def genNumReplies(num):
    if num <= 10:
        return 'a.low'
    if num <= 20:
        return "b.moderate"
    return "c.high"

The following lines generate a data frame that contains three columns: 1) Number of replies 2) Id corresponding to replies bin 3) Id corresponding to reciprocity bin

(The extra letters such as the a in "a.low" are just used so that pandas orders the columns in the way we want)


In [18]:
moreThanFive = allpairs[allpairs['num_replies'] > 5]
recipVec = moreThanFive['reciprocity']
numReplies = moreThanFive['num_replies']
ids = recipVec.apply(lambda val: genId(val))
groupedRep = numReplies.apply(lambda val: genNumReplies(val))
groupsdf = pd.DataFrame({"num_replies": numReplies, "ids": ids, "grouped_num_replies": groupedRep})
groupsdf


Out[18]:
grouped_num_replies ids num_replies
2 b.moderate D.(.75, 1.00] 16
11 b.moderate A.(0, .25] 18
13 a.low D.(.75, 1.00] 6
30 c.high D.(.75, 1.00] 52
32 b.moderate D.(.75, 1.00] 14
33 a.low D.(.75, 1.00] 6
34 a.low C.(.5, .75] 7
39 c.high D.(.75, 1.00] 64
46 b.moderate D.(.75, 1.00] 13
47 a.low D.(.75, 1.00] 6
51 a.low C.(.5, .75] 7
52 c.high D.(.75, 1.00] 55
57 a.low D.(.75, 1.00] 8
65 a.low B.(.25, .5] 6
86 b.moderate D.(.75, 1.00] 18
96 a.low B.(.25, .5] 6
102 a.low B.(.25, .5] 6
105 b.moderate D.(.75, 1.00] 13
109 a.low D.(.75, 1.00] 6
116 c.high C.(.5, .75] 58
121 a.low D.(.75, 1.00] 9
125 a.low C.(.5, .75] 7
127 a.low B.(.25, .5] 7
136 b.moderate D.(.75, 1.00] 16
139 a.low B.(.25, .5] 6
148 c.high D.(.75, 1.00] 22
155 b.moderate C.(.5, .75] 18
156 c.high D.(.75, 1.00] 22
170 b.moderate C.(.5, .75] 12
172 a.low D.(.75, 1.00] 6
181 a.low B.(.25, .5] 8
193 b.moderate C.(.5, .75] 16
206 b.moderate D.(.75, 1.00] 12
207 a.low A.(0, .25] 6
214 a.low A.(0, .25] 8
215 a.low D.(.75, 1.00] 9
220 a.low C.(.5, .75] 7
223 a.low D.(.75, 1.00] 6
224 a.low B.(.25, .5] 7
231 a.low D.(.75, 1.00] 10
233 a.low B.(.25, .5] 6
236 a.low D.(.75, 1.00] 8
237 c.high C.(.5, .75] 39
242 b.moderate D.(.75, 1.00] 12
245 a.low C.(.5, .75] 7
252 b.moderate C.(.5, .75] 12
256 b.moderate D.(.75, 1.00] 12
264 a.low C.(.5, .75] 7
265 b.moderate D.(.75, 1.00] 11
272 a.low C.(.5, .75] 10
284 b.moderate D.(.75, 1.00] 16
288 a.low D.(.75, 1.00] 6
296 b.moderate C.(.5, .75] 12
297 c.high D.(.75, 1.00] 80
306 c.high D.(.75, 1.00] 28
324 b.moderate C.(.5, .75] 12
333 c.high C.(.5, .75] 39
337 a.low D.(.75, 1.00] 10
344 a.low D.(.75, 1.00] 8
356 a.low C.(.5, .75] 7
... ... ...

587 rows × 3 columns

Now that we have this data frame lets look at the corresponding histograms for each "level" of reciprocity.


In [19]:
grpA = groupsdf[groupsdf["ids"] == "A.(0, .25]"]['num_replies']
grpB = groupsdf[groupsdf["ids"] == "B.(.25, .5]"]['num_replies']
grpC = groupsdf[groupsdf["ids"] == "C.(.5, .75]"]['num_replies']
grpD = groupsdf[groupsdf["ids"] == "D.(.75, 1.00]"]['num_replies']
grpA.value_counts().hist()
plt.title("Number of Replies for Reciprocity between 0-.25")


Out[19]:
<matplotlib.text.Text at 0x1084c3950>

In [20]:
grpB.value_counts().hist()
plt.title("Number of Replies for Reciprocity between .25-.5")


Out[20]:
<matplotlib.text.Text at 0x1090d8890>

In [21]:
grpC.value_counts().hist()
plt.title("Number of Replies for Reciprocity between .5-.75")


Out[21]:
<matplotlib.text.Text at 0x107c65890>

In [22]:
grpD.value_counts().hist()
plt.title("Number of Replies for Reciprocity between .75-1.0")


Out[22]:
<matplotlib.text.Text at 0x1092a6d10>

It's pretty hard to compare the four histograms so let's create a contingency table for the groupsdf data frame.


In [23]:
crossed = pd.crosstab(groupsdf["grouped_num_replies"], groupsdf["ids"])
crossed


Out[23]:
ids A.(0, .25] B.(.25, .5] C.(.5, .75] D.(.75, 1.00]
grouped_num_replies
a.low 27 77 84 135
b.moderate 13 13 45 78
c.high 7 6 26 76

3 rows × 4 columns


In [24]:
crossed.plot()


Out[24]:
<matplotlib.axes.AxesSubplot at 0x109681290>

Since each reciprocity group has a different amount of counts, let's normalize the counts to get a better picture of what's going on.

We will first normzalize column-wise, that is for say column A.[0,.25] we will sum the total number of responses and get the relative proportions for the replies bins.


In [25]:
crossed.apply(lambda r: r/sum(r), axis=0)


Out[25]:
ids A.(0, .25] B.(.25, .5] C.(.5, .75] D.(.75, 1.00]
grouped_num_replies
a.low 0.574468 0.802083 0.541935 0.467128
b.moderate 0.276596 0.135417 0.290323 0.269896
c.high 0.148936 0.062500 0.167742 0.262976

3 rows × 4 columns

We see that at the very extremes, namely reciprocity between 0-.25 and reciprocity between .75-1.0, there are some clear differences; reciprocity seems to be positively associated with the number of replies as we had initially expected.

On the other hand, the bin for reciprocity between .25-.5 weakens this association as this bin seems as if it should swap positions with bin A. However, since the bin widths we chose were completely arbitrary it may explain this paradox.

The fact that the extremes seem to follow our expectations is quite interesting; it provides some evidence that if we choose our bin sizes appropriately, we can perhaps get a nice positive association.


In [26]:
crossed.apply(lambda r: r/sum(r), axis=0).plot()
plt.title("normalized (columnwise) plot")


Out[26]:
<matplotlib.text.Text at 0x109b85ad0>

Now will do the normalization by row. This will give us the relative proportion of some bin for number of replies is distributed across the bins for reciprocity.


In [27]:
crossed.apply(lambda r: r/sum(r), axis=1)


Out[27]:
ids A.(0, .25] B.(.25, .5] C.(.5, .75] D.(.75, 1.00]
grouped_num_replies
a.low 0.083591 0.238390 0.260062 0.417957
b.moderate 0.087248 0.087248 0.302013 0.523490
c.high 0.060870 0.052174 0.226087 0.660870

3 rows × 4 columns


In [28]:
crossed.apply(lambda r: r/sum(r), axis=1).plot()
plt.title("normalized (row-wise) plot")


Out[28]:
<matplotlib.text.Text at 0x109ee9810>

Now let's go back and do the exact same thing but not removing entries with a very low amount of replies.


In [29]:
recipVec2 = allpairs['reciprocity']
numReplies2 = allpairs['num_replies']
ids = recipVec2.apply(lambda val: genId(val))
groupedRep2 = numReplies2.apply(lambda val: genNumReplies(val))
groupsdf2 = pd.DataFrame({"num_replies": numReplies2, "ids": ids, "grouped_num_replies": groupedRep2})

In [30]:
crossed2 = pd.crosstab(groupsdf2["grouped_num_replies"], groupsdf2["ids"])
crossed2


Out[30]:
ids A.(0, .25] B.(.25, .5] C.(.5, .75] D.(.75, 1.00]
grouped_num_replies
a.low 2253 422 170 883
b.moderate 13 13 45 78
c.high 7 6 26 76

3 rows × 4 columns


In [31]:
crossed2.plot()


Out[31]:
<matplotlib.axes.AxesSubplot at 0x109f9a990>

In [32]:
crossed2.apply(lambda r: r/sum(r), axis=0)


Out[32]:
ids A.(0, .25] B.(.25, .5] C.(.5, .75] D.(.75, 1.00]
grouped_num_replies
a.low 0.991201 0.956916 0.705394 0.851495
b.moderate 0.005719 0.029478 0.186722 0.075217
c.high 0.003080 0.013605 0.107884 0.073288

3 rows × 4 columns


In [33]:
crossed2.apply(lambda r: r/sum(r), axis=0).plot()


Out[33]:
<matplotlib.axes.AxesSubplot at 0x10a038cd0>

Now we will look at various scatterplots for different variables to get a rough sense of how our data is spread.


In [34]:
plt.scatter(allpairs.num_replies, allpairs.reciprocity)
plt.title("number of replies vs. reciprocity")


Out[34]:
<matplotlib.text.Text at 0x10a35f950>

In [35]:
allpairs['duration'] = allpairs['duration'].apply(lambda x: x.item() / pow(10,9))
plt.scatter(allpairs.duration, allpairs.num_replies)
plt.title("duration vs. number of replies")


Out[35]:
<matplotlib.text.Text at 0x10a3f1a90>

Now let's look at some scatterplots for the entries with number of replies greater than 5.


In [36]:
df_filt = allpairs[allpairs['num_replies'] > 5]
plt.scatter(df_filt.reciprocity, df_filt.duration)
plt.title("reciprocity vs. duration")
from scipy.stats.stats import pearsonr
pearsonr(df_filt.reciprocity, df_filt.duration)


Out[36]:
(0.001299634253067017, 0.97493415924233306)

In [37]:
plt.scatter(df_filt.reciprocity, df_filt.duration.apply(lambda x: math.log(x)))
plt.title("reciprocity vs. log of duration")


Out[37]:
<matplotlib.text.Text at 0x10af8d690>

In [38]:
plt.hist(np.log1p(df_filt.duration.values))


Out[38]:
(array([   1.,    2.,    4.,   10.,   24.,   29.,   77.,   93.,  209.,  138.]),
 array([  6.35437004,   7.64088059,   8.92739115,  10.2139017 ,
         11.50041226,  12.78692281,  14.07343336,  15.35994392,
         16.64645447,  17.93296503,  19.21947558]),
 <a list of 10 Patch objects>)

In [39]:
plt.scatter(df_filt.duration.apply(lambda x: math.log(x+1)), df_filt.num_replies.apply(lambda x: math.log(x+1)))
plt.title("log of duration vs. log of number of replies")


Out[39]:
<matplotlib.text.Text at 0x10b0d73d0>

In [73]:
import seaborn as sns

sns.jointplot(df_filt.reciprocity, np.log1p(df_filt.duration),kind="hex")


Out[73]:
<seaborn.axisgrid.JointGrid at 0x1103eca90>

Since we actually have the bodies of each message, we will now procceed by seeing if there are any patterns between the type of messages sent and reciprocity, duration, and the number of replies.

As a very rough measure, we have created a function calMessageLen that calculates the length of a given message.


In [41]:
def calMessageLen(message):
    if message == None:
        return 0
    return len(message)

arx.data['length'] = arx.data['Body'].apply(lambda x: calMessageLen(x))
arx.data


Out[41]:
From Subject Date In-Reply-To References Body length
Message-ID
<NEBBIECAMLMAAKHEGPCGKEBHCLAA.travis@vaught.net> travis at vaught.net (Travis N. Vaught) [SciPy-dev] SciPy Developer mailing list now o... 2001-06-11 02:10:51 None None The link:\n\nhttp://scipy.net/mailman/listinfo... 116
<Pine.LNX.4.33.0107231957590.15960-100000@oliphant.ee.byu.edu> oliphant at ee.byu.edu (Travis Oliphant) [SciPy-dev] RPMs and source distribution 2001-07-24 02:01:00 <02f001c111bf$2e78a9d0$777ba8c0@190xb01> None I've been playing for hours and finally have i... 203
<87snfmn8o1.fsf@jeeves.blindglobe.net> rossini at blindglobe.net (A.J. Rossini) [SciPy-dev] RPMs and source distribution 2001-07-24 06:27:42 <Pine.LNX.4.33.0107231957590.15960-100000@olip... <Pine.LNX.4.33.0107231957590.15960-100000@olip... >>>>> "TO" == Travis Oliphant <oliphant at ee.... 800
<Pine.LNX.4.33.0107240041500.16169-100000@oliphant.ee.byu.edu> oliphant at ee.byu.edu (Travis Oliphant) [SciPy-dev] RPMs and source distribution 2001-07-24 06:46:36 <87snfmn8o1.fsf@jeeves.blindglobe.net> None > >>>>> "TO" == Travis Oliphant <oliphant at e... 753
<87itgisti1.fsf@jeeves.blindglobe.net> rossini at blindglobe.net (A.J. Rossini) [SciPy-dev] RPMs and source distribution 2001-07-24 06:58:46 <Pine.LNX.4.33.0107240041500.16169-100000@olip... <Pine.LNX.4.33.0107240041500.16169-100000@olip... >>>>> "TO" == Travis Oliphant <oliphant at ee.... 700
<20010725005337.PHSJ11181.tomts14-srv.bellnexxia.net@localhost> tjlahey at mud.cgl.uwaterloo.ca (Tim Lahey) [SciPy-dev] Building SciPy on Mac OS X 2001-07-25 00:53:00 None None All,\n\nI've decide that I'm going to tackle g... 882
<20010725034411.SATM11181.tomts14-srv.bellnexxia.net@localhost> tjlahey at mud.cgl.uwaterloo.ca (Tim Lahey) [SciPy-dev] Building SciPy on Mac OS X Part II 2001-07-25 03:43:46 None None All,\n\nAs I mentioned in my previous message,... 518
<20010801231139.YWP4521.tomts13-srv.bellnexxia.net@localhost> tjlahey at mud.cgl.uwaterloo.ca (Tim Lahey) [SciPy-dev] Splines in SciPy 2001-08-01 23:09:41 None None All,\n\nGetting the new version of SciPy from ... 796
<024e01c11b4e$f5510050$7600a8c0@ericlaptop> eric at scipy.org (eric jones) Fw: [SciPy-dev] Splines in SciPy 2001-08-02 12:30:42 None None Hey Tim,\n\n I just noticed this addition too.... 2031
<Pine.LNX.4.33.0108021551360.838-100000@sugar.ee.byu.edu> oliphant at ee.byu.edu (Travis Oliphant) [SciPy-dev] Spline module 2001-08-02 21:55:24 <200108021700.f72H03D17694@scipy.org> None >\n> Getting the new version of SciPy from CVS... 2023
<00e601c11bc5$0dbea630$c300a8c0@ericlaptop> eric at scipy.org (eric jones) [SciPy-dev] Spline module 2001-08-03 02:36:09 None <Pine.LNX.4.33.0108021551360.838-100000@sugar.... I'm no splines expert, but I'd hope there was ... 3106
<200108031856.OAA29753@mud.cgl.uwaterloo.ca> tjlahey at mud.cgl.uwaterloo.ca (Tim Lahey) [SciPy-dev] Re: Splines module 2001-08-03 18:56:28 <200108031700.f73H01D31118@scipy.org> from "sc... None > \n> Date: Thu, 2 Aug 2001 15:55:24 -0600 (MD... 2380
<005101c12a92$ce8e2b20$ec88fea9@ericlaptop> eric at scipy.org (eric jones) [SciPy-dev] RPMs and source distribution 2001-08-21 22:44:15 None <Pine.LNX.4.33.0107231957590.15960-100000@olip... Hey Joe,\n\nDebian packages would be good. Yo... 2361
<87u1z1j90m.fsf@phantom.ecn.uiowa.edu> jmr at engineering.uiowa.edu (Joe Reinhardt) [SciPy-dev] RPMs and source distribution 2001-08-21 23:19:53 <87snfmn8o1.fsf@jeeves.blindglobe.net>\n (ross... <Pine.LNX.4.33.0107231957590.15960-100000@olip... Did this debian package ever materialize? \n... 991
<009f01c12aaa$3bee89a0$ec88fea9@ericlaptop> eric at scipy.org (eric jones) [SciPy-dev] AIX isinf, isfinite, etc. 2001-08-22 01:31:57 None None The good news:\n\nAlmost all of scipy now comp... 667
<NEBBIECAMLMAAKHEGPCGCEKICMAA.travis@enthought.com> travis at scipy.org (Travis N. Vaught) [SciPy-dev] RPMs and source distribution 2001-08-22 03:44:22 <87u1z1j90m.fsf@phantom.ecn.uiowa.edu> None We haven't heard anything yet about a working ... 1635
<87u1z01ef6.fsf@jeeves.blindglobe.net> rossini at blindglobe.net (A.J. Rossini) [SciPy-dev] RPMs and source distribution 2001-08-22 12:12:45 <NEBBIECAMLMAAKHEGPCGCEKICMAA.travis@enthought... <NEBBIECAMLMAAKHEGPCGCEKICMAA.travis@enthought... >>>>> "TNV" == Travis N Vaught <travis at scip... 885
<lg13d6klyt3.fsf@pet.ecn.uiowa.edu> jmr at engineering.uiowa.edu (Joe Reinhardt) [SciPy-dev] RPMs and source distribution 2001-08-22 18:44:24 <87u1z01ef6.fsf@jeeves.blindglobe.net>\n (ross... <NEBBIECAMLMAAKHEGPCGCEKICMAA.travis@enthought... rossini at blindglobe.net (A.J. Rossini) write... 875
<20010902195926.A20656@myrddin.caltech.edu> kern at caltech.edu (Robert Kern) [SciPy-dev] LAPACK is not thread-safe (AFAICT) 2001-09-03 02:59:26 None None Disclaimer: I'm not a FORTRAN programmer or a ... 1213
<3B93B249.A3E8B37D@home.com> europax at home.com (Rob) [SciPy-dev] documentation 2001-09-03 16:39:38 None None Is there a non-html version of the documentati... 388
<00c101c1370e$f6758010$c100a8c0@ericlaptop> eric at scipy.org (eric jones) [SciPy-dev] scipy.linalg and LAPACK 3.0 2001-09-06 20:03:13 None <20010906133610.A18337@myrddin.caltech.edu> I'd say the latter of the two. I started lina... 1743
<20010906133610.A18337@myrddin.caltech.edu> kern at caltech.edu (Robert Kern) [SciPy-dev] scipy.linalg and LAPACK 3.0 2001-09-06 20:36:10 None None Hi,\n\nSome of the routines wrapped in generic... 508
<86d74cr9q2.fsf@bock.chem.unc.edu> jochen at unc.edu (Jochen =?iso-8859-1?q?K=FCp... [SciPy-dev] SciPy on Cygwin 2001-09-27 14:26:13 None None I am trying to compile SciPy (cvs) on latest C... 3216
<Pine.LNX.4.33.0109271117580.25675-100000@oliphant.ee.byu.edu> oliphant at ee.byu.edu (Travis Oliphant) [SciPy-dev] Re: Scipy-dev digest, Vol 1 #11 - ... 2001-09-27 17:19:26 <200109271700.f8RH02U04182@scipy.org> None > 1. SciPy on Cygwin (Jochen =?iso-8859-1?q... 1431
<Pine.LNX.4.21.0109271959570.22161-100000@cens.ioc.ee> pearu at cens.ioc.ee (Pearu Peterson) [SciPy-dev] Re: Scipy-dev digest, Vol 1 #11 - ... 2001-09-27 18:10:57 <Pine.LNX.4.33.0109271117580.25675-100000@olip... None On Thu, 27 Sep 2001, Travis Oliphant wrote:\n\... 1367
<Pine.LNX.4.21.0109272015040.22161-100000@cens.ioc.ee> pearu at cens.ioc.ee (Pearu Peterson) [SciPy-dev] Re: Scipy-dev digest, Vol 1 #11 - ... 2001-09-27 18:16:32 <Pine.LNX.4.21.0109271959570.22161-100000@cens... None On Thu, 27 Sep 2001, Pearu Peterson wrote:\n\n... 196
<20010927124852.A18567@myrddin.caltech.edu> kern at caltech.edu (Robert Kern) [SciPy-dev] Re: Scipy-dev digest, Vol 1 #11 - ... 2001-09-27 19:48:52 <Pine.LNX.4.21.0109271959570.22161-100000@cens... <Pine.LNX.4.33.0109271117580.25675-100000@olip... On Thu, Sep 27, 2001 at 08:10:57PM +0200, Pear... 1637
<20010927164848.Q1356@dothill.com> jason at tishler.net (Jason Tishler) [SciPy-dev] Re: SciPy on Cygwin 2001-09-27 20:48:48 <86d74cr9q2.fsf@bock.chem.unc.edu> None Jochen,\n\nOn Thu, Sep 27, 2001 at 10:26:13AM ... 777
<86g097pgqt.fsf@bock.chem.unc.edu> jochen at jochen-kuepper.de (Jochen =?iso-8859... [SciPy-dev] SciPy on Cygwin 2001-09-28 14:48:13 <86d74cr9q2.fsf@bock.chem.unc.edu> <86d74cr9q2.fsf@bock.chem.unc.edu> A little further, but now I am having linking ... 4848
<20010928135700.X1356@dothill.com> jason at tishler.net (Jason Tishler) [SciPy-dev] SciPy on Cygwin 2001-09-28 17:57:00 <86g097pgqt.fsf@bock.chem.unc.edu> None Jochen,\n\nOn Fri, Sep 28, 2001 at 10:48:13AM ... 5089
<Pine.LNX.4.21.0109281946030.31117-100000@cens.ioc.ee> pearu at cens.ioc.ee (Pearu Peterson) [SciPy-dev] SciPy on Cygwin 2001-09-28 18:03:10 <86g097pgqt.fsf@bock.chem.unc.edu> None On 28 Sep 2001, Jochen K?pper wrote:\n\n> Runn... 802
<20010930142118.16379.qmail@lisboa.ifm.uni-kiel.de> jhauser at ifm.uni-kiel.de (Janko Hauser) [SciPy-dev] Documentation and then some 2001-09-30 14:21:18 None None Hi, I saw the notice for the new documentation... 1643
<5.1.0.14.2.20010930132153.00a7beb8@pop.fas.harvard.edu> wsryu at fas.harvard.edu (William Ryu) [SciPy-dev] Re: Scipy-dev digest, Vol 1 #14 - ... 2001-09-30 17:25:03 <200109301701.f8UH11U00314@scipy.org> None At 12:01 PM 9/30/2001 -0500, you wrote:\n>From... 1022
<20010930214531.16525.qmail@lisboa.ifm.uni-kiel.de> jhauser at ifm.uni-kiel.de (Janko Hauser) [SciPy-dev] Re: Scipy-dev digest, Vol 1 #14 - ... 2001-09-30 21:45:31 <5.1.0.14.2.20010930132153.00a7beb8@pop.fas.ha... <200109301701.f8UH11U00314@scipy.org> <5.1.0.1... Sorry for the last mail if it actually came th... 197
<wkofnegqee.fsf@lee-morgan.net> python at lee-morgan.net (python at lee-morgan... [SciPy-dev] scipy plt can only plot once? 2001-10-11 15:14:33 None None After investigation its due to a division erro... 1169
<KPEIKEPEIBGNGNKHDILIEEEPCEAA.travis@enthought.com> travis at scipy.org (Travis N. Vaught) [SciPy-dev] scipy plt can only plot once? 2001-10-11 15:22:01 <wkofnegqee.fsf@lee-morgan.net> None plt.figure() should create another wxFrame ins... 1786
<DGEHJAIMAIENCEHBIHCJAEGACIAA.altis@semi-retired.com> altis at semi-retired.com (Kevin Altis) [SciPy-dev] scipy plt can only plot once? 2001-10-11 17:29:01 <KPEIKEPEIBGNGNKHDILIEEEPCEAA.travis@enthought... None I'm new to scipy, but I'm trying out the tutor... 3671
<wk669lmllp.fsf@lee-morgan.net> python at lee-morgan.net (python at lee-morgan... [SciPy-dev] scipy plt can only plot once? 2001-10-12 06:13:54 None None > \n>Travis N. Vaught\n>\n>plt.figure() shoul... 269
<01eb01c15349$14069be0$c100a8c0@ericlaptop> ej at ee.duke.edu (eric jones) [SciPy-dev] scipy plt can only plot once? 2001-10-12 18:09:46 None <DGEHJAIMAIENCEHBIHCJAEGACIAA.altis@semi-retir... ----- Original Message -----\nFrom: "Kevin Alt... 1029
<DGEHJAIMAIENCEHBIHCJEEJFCIAA.altis@semi-retired.com> altis at semi-retired.com (Kevin Altis) [SciPy-dev] SciPy versus Matlab, Excel, and ot... 2001-10-15 17:39:46 None None I asked a number of friends involved in scienc... 1162
<86r8rzcy55.fsf@bock.chem.unc.edu> jochen at unc.edu (Jochen =?iso-8859-1?q?K=FCp... [SciPy-dev] cvs problems? 2001-10-19 17:55:50 None None -----BEGIN PGP SIGNED MESSAGE-----\nHash: SHA1... 996
<m34rottvkf.fsf@box.home.de> jochen at jochen-kuepper.de (Jochen =?iso-8859... [SciPy-dev] plt 2001-10-21 05:24:00 None None -----BEGIN PGP SIGNED MESSAGE-----\nHash: SHA1... 1104
<Pine.LNX.4.10.10110211647340.14279-100000@pc103.maths.bris.ac.uk> a.backer at bristol.ac.uk (Arnd Baecker) [SciPy-dev] Link problem with scipy on SUN ? 2001-10-21 20:49:42 None None Hi,\n\nwhen trying to install scipy I ran into... 2009
<033101c15bd7$82d0e8f0$777ba8c0@ericlaptop> ejones17 at austin.rr.com (ERIC JONES) [SciPy-dev] plt 2001-10-23 15:29:26 None <m34rottvkf.fsf@box.home.de> Hey Jochen,\n\n> Is there a good reason not to... 2026
<033501c15bd7$87062e30$777ba8c0@ericlaptop> ejones17 at austin.rr.com (ERIC JONES) [SciPy-dev] cvs problems? 2001-10-23 15:29:38 None <86r8rzcy55.fsf@bock.chem.unc.edu> Sorry bout that. Some files were added and gr... 1603
<033e01c15bdb$b5c38660$777ba8c0@ericlaptop> ejones17 at austin.rr.com (ERIC JONES) [SciPy-dev] Link problem with scipy on SUN ? 2001-10-23 15:59:34 None <Pine.LNX.4.10.10110211647340.14279-100000@pc1... Hey Arnd,\n\nShort answer:\n\nSun has only bee... 3874
<03a801c15be9$a77e3ba0$777ba8c0@ericlaptop> ejones17 at austin.rr.com (ERIC JONES) Fw: [SciPy-dev] Link problem with scipy on SUN ? 2001-10-23 17:39:15 None <Pine.LNX.4.10.10110231837420.17942-100000@pc1... Hey,\n\nSorry that was a typo.\n\nI meant\n\n ... 8889
<m37ktl3kfu.fsf@box.home.de> jochen at jochen-kuepper.de (Jochen =?iso-8859... [SciPy-dev] cvs problems? 2001-10-24 01:17:57 <033501c15bd7$87062e30$777ba8c0@ericlaptop> <86r8rzcy55.fsf@bock.chem.unc.edu>\n\t<033501c... -----BEGIN PGP SIGNED MESSAGE-----\nHash: SHA1... 854
<m3k7xl1udk.fsf@box.home.de> jochen at jochen-kuepper.de (Jochen =?iso-8859... [SciPy-dev] wxplt 2001-10-24 05:26:15 None None Patch against current cvs, cleaning up wildcar... 1976
<86zo6h2ilo.fsf@bock.chem.unc.edu> jochen at unc.edu (Jochen =?iso-8859-1?q?K=FCp... [SciPy-dev] PlotCanvas 2001-10-24 15:13:50 None None A non-text attachment was scrubbed...\nName: p... 206
<000b01c15cd1$8755c530$6600a8c0@ericlaptop> ejones17 at austin.rr.com (eric jones) [SciPy-dev] PlotCanvas 2001-10-24 21:19:03 None <86zo6h2ilo.fsf@bock.chem.unc.edu> Hey Jochen,\n\nThe figure() method is actually... 3010
<m3pu7c2z8g.fsf@box.home.de> jochen at jochen-kuepper.de (Jochen =?iso-8859... [SciPy-dev] SciPy versus Matlab, Excel, and ot... 2001-10-25 03:08:15 <DGEHJAIMAIENCEHBIHCJEEJFCIAA.altis@semi-retir... <DGEHJAIMAIENCEHBIHCJEEJFCIAA.altis@semi-retir... Kevin, All,\n\njust stumbled over this. Maybe ... 3639
<m3elns2vkw.fsf_-_@box.home.de> jochen at jochen-kuepper.de (Jochen =?iso-8859... [SciPy-dev] Re: PlotCanvas 2001-10-25 04:27:11 <000b01c15cd1$8755c530$6600a8c0@ericlaptop> <86zo6h2ilo.fsf@bock.chem.unc.edu>\n\t<000b01c... On Wed, 24 Oct 2001 17:19:03 -0400 eric jones ... 2906
<004701c15d55$d889d700$c400a8c0@ericlaptop> ejones17 at austin.rr.com (eric jones) [SciPy-dev] reading ascii files into arrays 2001-10-25 13:06:22 None <18646.1003998017@www7.gmx.net> > Hi Eric !\n>\n> I have decided to use python... 2512
<87vgh3lvh8.fsf@jeeves.blindglobe.net> rossini at blindglobe.net (A.J. Rossini) [SciPy-dev] SciPy versus Matlab, Excel, and ot... 2001-10-25 13:06:43 <m3pu7c2z8g.fsf@box.home.de> <DGEHJAIMAIENCEHBIHCJEEJFCIAA.altis@semi-retir... >>>>> "Jochen" == Jochen K?pper <jochen at joc... 1245
<005e01c15d58$d09845b0$c400a8c0@ericlaptop> ejones17 at austin.rr.com (eric jones) [SciPy-dev] Re: PlotCanvas 2001-10-25 13:27:37 None <86zo6h2ilo.fsf@bock.chem.unc.edu><000b01c15cd... > Not exactly. What I really need is to tell p... 1656
<007501c15d5b$66ae9890$c400a8c0@ericlaptop> ejones17 at austin.rr.com (eric jones) [SciPy-dev] SciPy versus Matlab, Excel, and ot... 2001-10-25 13:46:08 None <DGEHJAIMAIENCEHBIHCJEEJFCIAA.altis@semi-retir... I'm not from the camp of "Excel bad, Command l... 3176
<Pine.LNX.4.33L2.0110251154480.26035-100000@oliphant.ee.byu.edu> oliphant at ee.byu.edu (Travis Oliphant) [SciPy-dev] Reading in arrays. 2001-10-25 16:03:25 <200110251607.f9PG7E526198@scipy.org> None As eric pointed out reading arrays of ASCII da... 2289
<DGEHJAIMAIENCEHBIHCJCEHHCJAA.altis@semi-retired.com> altis at semi-retired.com (Kevin Altis) [SciPy-dev] SciPy versus Matlab, Excel, and ot... 2001-10-25 16:08:00 <87vgh3lvh8.fsf@jeeves.blindglobe.net> None Don't shoot the messenger. :) My purpose in br... 3880
<00f501c15d82$124380f0$c400a8c0@ericlaptop> eric at scipy.org (eric jones) [SciPy-dev] Reading in arrays. 2001-10-25 18:22:57 None <Pine.LNX.4.33L2.0110251154480.26035-100000@ol... > As eric pointed out reading arrays of ASCII ... 4142
... ... ... ... ... ... ...

20405 rows × 7 columns


In [42]:
G = nx.Graph(list(pairs)[:5])

In [43]:
nx.draw_spring(G)



In [44]:
nx.draw(G)



In [45]:
nx.degree_assortativity_coefficient(G)


Out[45]:
1.0000000000000022

In [46]:
nx.average_neighbor_degree(G)


Out[46]:
{'aisaac@american.... (Alan G Isaac)': 1.0,
 'charlesr.harris@gmail.... (Charles R Harris)': 1.0,
 'fperez.net@gmail.... (Fernando Perez)': 1.0,
 'lists@hilboll... (Andreas H.)': 1.0,
 'luethi@vaw.baug.ethz... (Martin =?ISO-8859-1?Q?L=FCthi?=)': 1.0,
 'michael.abshoff@googlemail.... (Michael Abshoff)': 1.0,
 'nils106@googlemail.... (Nils Wagner)': 1.0,
 'ralf.gommers@gmail.... (Ralf Gommers)': 1.0,
 'rob.clewley@gmail.... (Rob Clewley)': 1.0}

In [47]:
a = nx.average_degree_connectivity(G)

In [48]:
plt.scatter(a.keys(), a.values())


Out[48]:
<matplotlib.collections.PathCollection at 0x10b7f7690>

In [49]:
arx.data['From']


Out[49]:
Message-ID
<NEBBIECAMLMAAKHEGPCGKEBHCLAA.travis@vaught.net>                     travis at vaught.net (Travis N. Vaught)
<Pine.LNX.4.33.0107231957590.15960-100000@oliphant.ee.byu.edu>        oliphant at ee.byu.edu (Travis Oliphant)
<87snfmn8o1.fsf@jeeves.blindglobe.net>                              rossini at blindglobe.net (A.J. Rossini)
<Pine.LNX.4.33.0107240041500.16169-100000@oliphant.ee.byu.edu>        oliphant at ee.byu.edu (Travis Oliphant)
<87itgisti1.fsf@jeeves.blindglobe.net>                              rossini at blindglobe.net (A.J. Rossini)
<20010725005337.PHSJ11181.tomts14-srv.bellnexxia.net@localhost>     tjlahey at mud.cgl.uwaterloo.ca (Tim Lahey)
<20010725034411.SATM11181.tomts14-srv.bellnexxia.net@localhost>     tjlahey at mud.cgl.uwaterloo.ca (Tim Lahey)
<20010801231139.YWP4521.tomts13-srv.bellnexxia.net@localhost>     tjlahey at mud.cgl.uwaterloo.ca (Tim Lahey)
<024e01c11b4e$f5510050$7600a8c0@ericlaptop>                                   eric at scipy.org (eric jones)
<Pine.LNX.4.33.0108021551360.838-100000@sugar.ee.byu.edu>           oliphant at ee.byu.edu (Travis Oliphant)
<00e601c11bc5$0dbea630$c300a8c0@ericlaptop>                                   eric at scipy.org (eric jones)
<200108031856.OAA29753@mud.cgl.uwaterloo.ca>                     tjlahey at mud.cgl.uwaterloo.ca (Tim Lahey)
<005101c12a92$ce8e2b20$ec88fea9@ericlaptop>                                   eric at scipy.org (eric jones)
<87u1z1j90m.fsf@phantom.ecn.uiowa.edu>                          jmr at engineering.uiowa.edu (Joe Reinhardt)
<009f01c12aaa$3bee89a0$ec88fea9@ericlaptop>                                   eric at scipy.org (eric jones)
...
<A94D38D1-8C47-4AFD-AC81-6E9B14B1C208@gmail.com>                             nonhermitian@gmail.... (Paul Nation)
<5CCED7DF-FEFE-4DE8-804D-FDBDDCFFC655@gmail.com>                             nonhermitian@gmail.... (Paul Nation)
<CAOyNq8pohpyH4tK+7_r+99mAKeBjd2XD_JQd68XzifopY4MJFg@mail.gmail.com>          benny.malengier@gmail.... (Benny Malengier)
<CAFzxpWot3vWhu-e--kgOfcqA2uATzVdo4bggggFxZD7z2g4+DA@mail.gmail.com>               ghisvail@gmail.... (Ghislain Vaillant)
<CAFzxpWr=ghSvq5k9oPc_M7z97hxP9w-166wboFrbXAzHcsFErg@mail.gmail.com>               ghisvail@gmail.... (Ghislain Vaillant)
<E85DC9E7-485A-4002-8A9D-33364C5D7D48@gmail.com>                             nonhermitian@gmail.... (Paul Nation)
<DUB130-W833D7C61B8425CA52C9DFFF5100@phx.gbl>                                n59_ru@hotmail.... (Nikolay Mayorov)
<B609C13C-5F70-4B17-82A1-7449015DA586@gmail.com>                           charlesnwoods@gmail.... (Nathan Woods)
<CAGeA38nqwV=0heo5rDEjDssspOwtCr-Hn23VqwhAk1SQVeQgsA@mail.gmail.com>                  ewm@redtetrahedron.... (Eric Moore)
<CAMRo0iushKQWTU6k5CxAMpZz3XnJrWa=gUPgRkNyG2+fjGu8Cg@mail.gmail.com>         evgeny.burovskiy@gmail.... (Evgeni Burovski)
<CAPJVwBkJWBQcQpO=R0AnZZMg2Ab3iA6ZcHrt91JyLdvSLcnrMQ@mail.gmail.com>                      njs@pobox.... (Nathaniel Smith)
<CABL7CQgmMW--K2XMgXfuf_a-bPRgx58QaDghXvXoGH0iWs5HEA@mail.gmail.com>                ralf.gommers@gmail.... (Ralf Gommers)
<CAOQtOH3d+19qf5ux4fhRgS+4YyvXeF3=S_MXYhXA9azOBU983g@mail.gmail.com>            freddyrietdijk@fridh... (Freddy Rietdijk)
<DUB130-W34283B579FB0ABFFED1DD3F5110@phx.gbl>                                n59_ru@hotmail.... (Nikolay Mayorov)
<20030331115631.8C3D23EB09@www.scipy.com>                       mariam_y2k_abacha at yahoo.com (ALHAJI MOHAMME...
Name: From, Length: 20405, dtype: object

In [50]:
people_list = list(set(arx.data['From']))

In [51]:
people_list


Out[51]:
['mattknox_ca at hotmail.com (Matt Knox)',
 'e.antero.tammi@gmail.... (eat)',
 'david.grant at telus.net (David Grant)',
 'dwf@cs.toronto.... (David Warde-Farley)',
 'alan@ajackson.... (Alan Jackson)',
 'support at microsoft.com (support at microsoft.com)',
 'aisaac at american.edu (Alan G Isaac)',
 'skybar2003 at hotmail.com (skybar2003 at hotmail.com)',
 'opoku at ece.cmu.edu (Osei Poku)',
 'deshpande.jaidev@gmail.... (Jaidev Deshpande)',
 'david at ar.media.kyoto-u.ac.jp (David Cournapeau)',
 'scipy@SamuelJohn... (Samuel John)',
 'kroeger.niklas@gmail.... (=?ISO-8859-1?Q?Niklas_Kr=F6ger?=)',
 'Brian.Newsom@Colorado.... (Brian Lee Newsom)',
 'jpscipy@gmail.... (J P)',
 'ozancag@gmail.... (=?UTF-8?B?T3phbiDDh2HEn2xheWFu?=)',
 'general.mooney@googlemail.... (=?ISO-8859-1?Q?Ciar=E1n_Mooney?=)',
 'bkomaki@yahoo.... (Bairam Komaki)',
 'return at trafficmagnet.net (Christine Hall)',
 'pierre.haessig@crans.... (Pierre Haessig)',
 'gilles.rochefort@gmail.... (Gilles Rochefort)',
 'jan_braun at gmx.net (Jan Braun)',
 'jjstickel@vcn.... (Jonathan Stickel)',
 'zufus at zufus.org (Marco Presi)',
 'clee at spiralis.merseine.nu (clee at spiralis.merseine.nu)',
 'samtygier@yahoo.co... (sam tygier)',
 'dkaufman at imago.com (Duane Kaufman)',
 'charles.harris at sdl.usu.edu (Chuck Harris)',
 'twentypoundtrout@yahoo.... (Nate)',
 'hgamboa at gmail.com (Hugo Gamboa)',
 'roberto.bucher@supsi... (Roberto Bucher)',
 'steve@shrogers.... (Steven H. Rogers)',
 'jrocher@enthought.... (Jonathan Rocher)',
 'razimantv@gmail.... (Raziman T V)',
 'ellisonbg.net at gmail.com (Brian Granger)',
 'dd55@cornell.... (Darren Dale)',
 'peterson at math.utwente.nl (Pearu Peterson)',
 'anandps20@gmail.... (anand parthasarathy)',
 'cweng at cwnetdg.io (Engineering Staff)',
 'pnmiller at pacbell.net (Pat Miller)',
 'listservs at mac.com (listservs at mac.com)',
 'luke.pfister@gmail.... (Luke Pfister)',
 'zw4131@gmail.... (=?GB2312?B?va20886w?=)',
 'travis at enthought.com (Travis N. Vaught)',
 'lists at UltimateG.com (Mark Evans)',
 'gregg@renesys.... (Gregg Lind)',
 'europax at home.com (Rob)',
 'koepsell@gmail.... (killian koepsell)',
 'laytonjb at bellsouth.net (Jeff Layton)',
 'maniteja.modesty067@gmail.... (Maniteja Nandana)',
 'jason at tishler.net (Jason Tishler)',
 'david.kirkby@onetel.... (David Kirkby)',
 'Marketinges at eyou.com (Mailserver)',
 'eric at enthought.com (Eric)',
 'ndarray at mac.com (ndarray at mac.com)',
 'lukashev.s@ya... (=?koi8-r?B?7NXLwdvF1yDzxdLHxco=?=)',
 'timcera@earthlink.... (Tim Cera)',
 'izakmarais at yahoo.com (izak marais)',
 'samnemo at gmail.com (sam n)',
 'claes.hagstrom at swipnet.se (=?utf-8?Q?Claes_Hagstr=C3=B6m?=)',
 'arstone208@gmail.... (Adam Stone)',
 'stewie.hannah@gmail.... (stuart)',
 'niels.ellegaard@gmail.... (Niels L. Ellegaard)',
 'curtis at lpl.arizona.edu (Curtis Cooper)',
 'charlesr.harris at gmail.com (Charles R Harris)',
 'fultz at cs.purdue.edu (Charles Fultz)',
 'warren.weckesser@gmail.... (Warren Weckesser)',
 'nwerneck@gmail.... (Nicolau Werneck)',
 'tournesol33@gmail.... (tournesol)',
 'bussonniermatthias@gmail.... (Matthias Bussonnier)',
 'Jiri.Krtek@rsj.... (Jiri Krtek)',
 'dwzba848t at influentialdelight.com (Luisa England)',
 'j_r_fonseca at yahoo.co.uk (=?iso-8859-1?Q?Jos=E9?= Fonseca)',
 'denis@laxalde.... (Denis Laxalde)',
 'jameskyle@ucla.... (James Kyle)',
 'burley@zonnet... (burley)',
 'david.kremer.dk@gmail.... (David Kremer)',
 'olivier.grisel@ensta.... (Olivier Grisel)',
 'flyeng4@gmail.... (William Purcell)',
 'dave.hirschfeld@gmail.... (Dave Hirschfeld)',
 'bhendrix at enthought.com (Bryce Hendrix)',
 'vanforeest@gmail.... (nicky van foreest)',
 'jwilson556@gmail.... (James Wilson)',
 'jabooth@gmail.... (James Booth)',
 'matthias@matthias-k.... (Matthias =?ISO-8859-1?Q?K=FCmmerer?=)',
 'chris at trichech.us (Christopher Fonnesbeck)',
 'benny.malengier@gmail.... (Benny Malengier)',
 'jwp at cns.nyu.edu (Jon Peirce)',
 'andrew.smirnov@gmail.... (Andrey Smirnov)',
 'dagss@student.matnat.uio... (Dag Sverre Seljebotn)',
 'warren.weckesser@enthought.... (Warren Weckesser)',
 'zbyszek@in.waw... (Zbyszek Szmek)',
 'y9bz4p4zaix7 at yahoo.com (Dee Young)',
 'jre@enthought.... (J. Ryan Earl)',
 'berthold.hoellmann@gl-group.... (=?iso-8859-15?Q?Berthold_H=F6llmann?=)',
 'Robert_Pickle@brown.... (Robert Pickle)',
 'Vincent.Virgilio at itt.com (Virgilio, Vincent)',
 'yangofzeal@gmail.... (Michael Yang)',
 'anubhab91@gmail.... (Anubhab Baksi)',
 'johannesbroedel at gmx.de (Johannes Broedel)',
 'claus@maths.lth... (Claus =?utf-8?b?RsO8aHJlcg==?=)',
 'ralf.gommers@googlemail.... (Ralf Gommers)',
 'mohitiiit@gmail.... (Mohit Goyal)',
 'ckkart@hoc.... (Christian K.)',
 'Kasper.Souren at ircam.fr (Kasper Souren)',
 'aisaac@american.... (Alan Isaac)',
 'lars@voxdahl.... (Lars Voxen Hansen)',
 'berthold@despammed.... (Berthold =?iso-8859-15?Q?H=F6llmann?=)',
 'prabhu at aero.iitm.ernet.in (Prabhu Ramachandran)',
 'annapowellsmith@gmail.... (Anna Powell-Smith)',
 'stefan.czesla@hs.uni-hamburg... (Stefan)',
 'sdpan21@gmail.... (dp docs)',
 'nwagner at iam.uni-stuttgart.de (Nils Wagner)',
 'oliphant.travis@ieee.... (Travis Oliphant)',
 'alexandre.fayolle@logilab... (Alexandre Fayolle)',
 'pearu at cens.ioc.ee (Pearu)',
 'efiring@hawaii.... (Eric Firing)',
 'mws@lionex... (Maximilian Singh)',
 'l80w2dc2hd at aol.com (Susan Delong)',
 'josh.k.lawrence@gmail.... (Josh Lawrence)',
 'ssclift@gmail.... (Simon Clift)',
 'timmichelsen@gmx-topma... (timmichelsen@gmx-topma...)',
 'huw_lynch@yahoo.... (Huw Lynch)',
 'angel.yanguas@gmail.... (Angel Yanguas-Gil)',
 'strang at nmr.mgh.harvard.edu (Gary Strangman)',
 'ajo.fod@gmail.... (Ajo Fod)',
 'casperskovby@gmail.... (Casper Skovby)',
 'ccasey@enthought.... (Chris Casey)',
 'bsouthey at gmail.com (Bruce Southey)',
 'dahl.joachim@gmail.... (Joachim Dahl)',
 'scopatz@gmail.... (Anthony Scopatz)',
 'nicole.haenni@gmail.... (Nicole Haenni)',
 'eraldo.pomponi@gmail.... (Eraldo Pomponi)',
 'alan.isaac@gmail.... (Alan G Isaac)',
 'tbach at bu.edu (Ted Bach)',
 'jh at comunit.de (Janko)',
 'nbecker at fred.net (nbecker at fred.net)',
 'opossumnano@gmail.... (Tiziano Zito)',
 'as8ca at virginia.edu (Alok Singhal)',
 'raniere@ime.unicamp... (Raniere Silva)',
 'denis.laxalde@mcgill... (Denis Laxalde)',
 'nicopernetty at yahoo.fr (Nicolas Pernetty)',
 'xabart@gmail.... (Xavier Barthelemy)',
 'joshua.m.grant@gmail.... (Joshua Grant)',
 'd.s.seljebotn@astro.uio... (Dag Sverre Seljebotn)',
 'svenbrauch@googlemail.... (Sven Brauch)',
 'swisher at enthought.com (Janet Swisher)',
 'z1z2 at sympatico.ca (MG Publishing)',
 'kyle.mandli@gmail.... (Kyle Mandli)',
 'ceball@gmail.... (Chris Ball)',
 'hagberg at lanl.gov (Aric Hagberg)',
 'sarms@unidata.ucar.... (Sean Arms)',
 'c.schmidt-hieber@ucl.ac... (Christoph Schmidt-Hieber)',
 'stephen.walton at csun.edu (Stephen Walton)',
 'luthi at gi.alaska.edu (=?iso-8859-1?q?Martin_L=FCthi?=)',
 'have at nice.day (John Williams)',
 'jenny.stone125@gmail.... (Jennifer stone)',
 'jstrunk at enthought.com (Jeff Strunk)',
 'nkilleen@unimelb.edu... (Neil Killeen)',
 'rng7 at cornell.edu (Ryan Gutenkunst)',
 'lists@onerussian.... (Yaroslav Halchenko)',
 'brian.lee.hawthorne@gmail.... (Brian Hawthorne)',
 'berthold.hoellmann@gl-group.... (=?iso-8859-15?Q?Berthold_=22H=F6llmann=22?=)',
 'dpeterson@enthought.... (Dave Peterson)',
 'oliphant@ee.by... (oliphant@ee.by...)',
 'oliphant at ee.byu.edu (Travis Oliphant)',
 'bigorneault@gmail.... (=?ISO-8859-1?Q?Th=E9lesphonse_Bigorneault?=)',
 'howarth@bromo.med.uc.... (Jack Howarth)',
 'padarn@gmail.... (Padarn Wilson)',
 'paustin at eos.ubc.ca (Philip Austin)',
 'pebarrett at gmail.com (Paul Barrett)',
 'virgilio at ieee.org (Vincent N. Virgilio)',
 'travis@vaught.... (Travis Vaught)',
 'david@ar.media.kyoto-u.ac... (David Cournapeau)',
 'eugeneai@irnok.... (=?UTF-8?B?0JXQstCz0LXQvdC40Lkg0KfQtdGA0LrQsNGI0LjQvQ==?=)',
 'collinstocks@gmai... (collinstocks@gmai...)',
 'grogers at enthought.com (grogers)',
 'patmiller at llnl.gov (Pat Miller)',
 'gruben at bigpond.net.au (Gary Ruben)',
 'roberto.bucher@supsi... (SUPSI)',
 'alexandre.gramfort@inria... (Alexandre Gramfort)',
 'Fernando.Perez at colorado.edu (Fernando.Perez at colorado.edu)',
 'gkclri@yahoo.... (Gopalakrishnan Ravimohan)',
 'gvrooyen@gmail.... (G-J van Rooyen)',
 'eric at scipy.org (eric jones)',
 'xavier.gnata@gmai... (xavier.gnata@gmai...)',
 'phaustin@gmail.... (Phil Austin)',
 'drazen.lucanin@gmail.... (=?ISO-8859-2?Q?Dra=BEen_Lu=E8anin?=)',
 'adam at enthought.com (Adam Barker)',
 'john.stachurski@gmail.... (John Stachurski)',
 'freddyrietdijk@fridh... (Freddy Rietdijk)',
 'P.Schellart@astro.ru... (Pim Schellart)',
 'kwgoodman@gmail.... (Keith Goodman)',
 'anand.prabhakar.patil@gmail.... (anand.prabhakar.patil)',
 'sartout at math.u-strasbg.fr (Alain Sartout)',
 'stefan.otte@gmail.... (Stefan Otte)',
 'chris at fisher.forestry.uga.edu (Christopher Fonnesbeck)',
 'jeremit0@gmail.... (Jeremy Conlin)',
 'david.douard at logilab.fr (David Douard)',
 'i.joung@gmail.... (InSuk Joung)',
 'illoul_lounes@yahoo... (illoul lounes)',
 'thouis@gmail.... (Thouis (Ray) Jones)',
 'kern at ugcs.caltech.edu (Robert Kern)',
 'fperez.net@gmail.... (Fernando Perez)',
 'deil.christoph@googlemail.... (Christoph Deil)',
 'Chris.Barker at noaa.gov (Chris Barker)',
 'moritz.beber@gmail.... (Moritz Emanuel Beber)',
 'cjordan1@uw.... (Christopher Jordan-Squire)',
 'arnar.flatberg@gmail.... (Arnar Flatberg)',
 'lopmart@gmail.... (jose luis Lopez Martinez)',
 'hanu_man_ji@hotmail.... (jai hanuman)',
 'hodge@stsci.... (Phil Hodge)',
 'gaston.fiore@gmail.... (Gaston Fiore)',
 'vanderplas@astro.washington.... (Jacob VanderPlas)',
 'cohen@lpta.in2p3... (Johann Cohen-Tanugi)',
 'halish@kofeina.... (Mateusz Haligowski)',
 'arokem@berkeley.... (Ariel Rokem)',
 'miguel-py@astro.su... (Miguel de Val-Borro)',
 'gazzar at email.com (Gary Ruben)',
 'bounce at trafficmagnet.com (Sarah Williams)',
 'email.administrator at knightfrank.com (email.administrator at knightfrank.com)',
 'joel.schaerer@insa-lyon... (Joel Schaerer)',
 'a.schmolck at gmx.net (A.Schmolck)',
 'wbuckner@beatsmusic.... (Will Buckner)',
 'pgmdevlist at mailcan.com (Pierre GM)',
 'msyang@alumni.princeton.... (Michael Yang)',
 'v-nijs@kellogg.northwestern.... (Vincent Nijs)',
 'Norbert.Nemec.list@gmx... (Norbert Nemec)',
 'rstanchak at yahoo.com (Roman Stanchak)',
 'couge.chen@gmail.... (Couge Chen)',
 'david at dwavesys.com (David Grant)',
 'nmb at wartburg.edu (Neil Martinsen-Burrell)',
 'a.h.jaffe at gmail.com (Andrew Jaffe)',
 'patperry at stanford.edu (Patrick Perry)',
 'jhauser at ifm.uni-kiel.de (janko hauser)',
 'roybryant at seventwentyfour.com (Roy at SEVENtwentyfour Inc.)',
 'matthew.brett at gmail.com (Matthew Brett)',
 'martin.teichmann@lkb.ens... (Martin Teichmann)',
 'nerduno.list@gmail.... (Aaron Andalman)',
 'cwebster@enthought.... (Corran Webster)',
 'guyer@nist.... (Jonathan Guyer)',
 'ejones17 at austin.rr.com (eric jones)',
 'trobinson@systemsbiology.... (Thomas Robinson)',
 'hazelnusse@gmail.... (Luke)',
 'Doug.LATORNELL at mdsinc.com (LATORNELL, Doug)',
 'jingw51 at hotmail.com (wu jing)',
 'cimrman3@ntc.zcu... (Robert Cimrman)',
 'carlson3 at aol.com (Della Mead)',
 'charles.harris at sdl.usu.edu (Charles R Harris)',
 'emanuele@relativita.... (Emanuele Olivetti)',
 'luszczek@eecs.utk.... (Piotr Luszczek)',
 'clarkfitzg@gmail.... (Clark Fitzgerald)',
 'eric at enthought.com (eric_imap)',
 'jelle.feringa at ezct.net (Jelle Feringa // EZCT / Amsterdam)',
 'magnus at thinkware.se (magnus at thinkware.se)',
 'mauger@physics.ucdavis.... (Matthew Auger)',
 'dsweeney@scimentis.... (Deacon Sweeney)',
 'nickel@dbs.ifi.lmu... (Maximilian Nickel)',
 'sales at smoking.com.net (Sales Department)',
 'skip at pobox.com (Skip Montanaro)',
 'fperez.net at gmail.com (Fernando Perez)',
 'mailstop at melcon-c.com (mailstop at melcon-c.com)',
 'dominique.orban@gmail.... (Dominique Orban)',
 'sturla@molden... (Sturla Molden)',
 'm.boumans@gmx.... (Marcus Boumans)',
 'kdavies4@gmail.... (Kevin Davies)',
 'piet@vanoostrum.... (Piet van Oostrum)',
 'fredrik.johansson@gmail.... (Fredrik Johansson)',
 'dickrp at wckn.com (Robert Dick)',
 'akinoame1 at gmail.com (Denis Simakov)',
 'victor.dvro@gmail.... (Dayvid Victor)',
 'dlenski@gmail.... (Dan Lenski)',
 'ks7uqzz25 at imaginationcostume.com (Everette Butts)',
 'jsilter@gmail.... (Jacob Silterra)',
 'pwang at enthought.com (Peter Wang)',
 'andreas.frankfurt at gmx.de (Andreas Lorenz)',
 'w.northcott at unsw.edu.au (Bill Northcott)',
 'falted at openlc.org (Francesc Alted)',
 'crwe@post... (=?us-ascii?Q?crwe=20crwe?=)',
 'mandrews at no-moreaccent.com (Martin Andrews)',
 'carlos.s.santos@gmail.... (Carlos da Silva Santos)',
 'heiko at hhenkelmann.de (Heiko Henkelmann)',
 'tim@cerazone.... (Tim Cera)',
 'prabhu_r at users.sf.net (Prabhu Ramachandran)',
 'ceball@users.sourceforge.... (C. Ball)',
 'bgoli at sun.ac.za (Brett Olivier)',
 'dalcinl@gmail.... (Lisandro Dalcin)',
 'kathy.phillips at mail.internetseer.com (Kathy Phillips)',
 'ryanlists at gmail.com (Ryan Krauss)',
 'arkapravobhaumik@gmail.... (Arkapravo Bhaumik)',
 'loredo at astrosun.astro.cornell.edu (Tom Loredo)',
 'jesusfritz_89 at whois.sc (Jesus Fritz)',
 'fperez at pizero.colorado.edu (Fernando Perez)',
 'golux at comcast.net (Stephen Waterbury)',
 'fperez at pizero.colorado.edu (=?ISO-8859-1?Q?Fernando_P=E9rez?=)',
 'jsw at cdc.noaa.gov (Jeff Whitaker)',
 'dmitrey15@ukr.... (Dmitrey)',
 'southey at ux1.cso.uiuc.edu (southey at ux1.cso.uiuc.edu)',
 'ndel314@gmail.... (Nico Del Piano)',
 's3lmtepn9iq at aol.com (Lynn Khan)',
 'skorpio11@gmail.... (Leon Adams)',
 'motocng at cq114.com.cn (Ms.huang)',
 'mpan@cesga... (Miguel Pan Fidalgo)',
 'nmb@wartburg.... (Neil Martinsen-Burrell)',
 'thomas.haslwanter@alumni.ethz... (Thomas Haslwanter)',
 'newville@cars.uchicago.... (Matt Newville)',
 'jba@sdf.lonestar.... (Jeffrey Armstrong)',
 'tanner@gmx... (Thomas Tanner)',
 'amcmorl@gmail.... (Angus McMorland)',
 'hardbyte@gmail.... (Brian Thorne)',
 'astrofitz@gmail.... (Michael Fitzgerald)',
 'rex@nosyntax.... (rex)',
 'bud at sistema.it (Bud P.Bruegger)',
 'Norbert.Nemec.List@gmx... (Norbert Nemec)',
 'joel.andersson@esat.kuleuven... (Joel Andersson)',
 'gregwillden at gmail.com (Greg Willden)',
 'cohen@slac.stanford.... (Johann Cohen-Tanugi)',
 'tteststudent@gmail.... (theodore test)',
 'david.huard at gmail.com (David Huard)',
 'a.u.r.e.l.i.a.n at gmx.net (=?ISO-8859-1?Q?=22Johannes_L=F6hnert=22?=)',
 'ygxulz920 at coolnclassy.com (Wilfred Campbell)',
 'johann.cohentanugi@gmail.... (Johann cohen-tanugi)',
 'sturla.molden@gmail.... (Sturla Molden)',
 'rmcgibbo@gmail.... (Robert McGibbon)',
 'ramercer@gmail.... (Adam Mercer)',
 'mxhf@gmx.... (Maximilian Fabricius)',
 'chris at pseudogreen.org (Christopher Stawarz)',
 'jdhunter at ace.bsd.uchicago.edu (John Hunter)',
 'nathan.faggian@gmail.... (Nathan Faggian)',
 'bravo.loic@gmail.... (=?UTF-8?B?TG/Dr2M=?= Berthe)',
 'pinto@mit.... (Nicolas Pinto)',
 'cristeab@gmail.... (Bogdan Cristea)',
 'ndarray at mac.com (Sasha)',
 '14772361@sun.ac... (Hamman, RA, Mr <14772361@sun.ac.za>)',
 'klrkdekira@gmail.... (CL Chow)',
 'JDM@MarchRay.... (Jonathan March)',
 'bhoel at web.de (Berthold =?iso-8859-15?q?H=F6llmann?=)',
 'vanderplas@astro.washington.... (Jake Vanderplas)',
 'python at lee-morgan.net (python at lee-morgan.net)',
 'cookedm at physics.mcmaster.ca (David M.Cooke)',
 'Nicolas.Rougier@loria... (Nicolas Rougier)',
 'roban@astro.columbia.... (Roban Kramer)',
 'h5d2l3v2q at imagination-unlimited.com (Vivian Cleveland)',
 'pyzun18 at yahoo.com.au (=?iso-8859-1?q?Johnny=20boi?=)',
 'thomas@kluyver.me... (Thomas Kluyver)',
 'cgalvan@enthought.... (Christopher Galvan)',
 'dfarning@sugarlabs.... (David Farning)',
 'luethi@vaw.baug.ethz... (Martin =?ISO-8859-1?Q?L=FCthi?=)',
 'ryanlists@gmail.... (Ryan Krauss)',
 'chris at fisher.forestry.uga.edu (chris at fisher.forestry.uga.edu)',
 'erik.tollerud@gmail.... (Erik Tollerud)',
 'ralf.gommers@gmail.... (Ralf Gommers)',
 'swisher at enthought.com (Janet M. Swisher)',
 'pearu.peterson@gmai... (pearu.peterson@gmai...)',
 'matt.terry@gmail.... (Matt Terry)',
 'guyer at nist.gov (Jonathan Guyer)',
 'jensj at fysik.dtu.dk (Jens =?ISO-8859-1?Q?J=F8rgen?= Mortensen)',
 'strawman@astraw.... (Andrew Straw)',
 'n59_ru@hotmail.... (Nikolay Mayorov)',
 'a.u.r.e.l.i.a.n at gmx.net (aurelian)',
 'eric@enthought.... (eric)',
 'jonathan.guyer@nist.... (Guyer, Jonathan E. Dr.)',
 'faltet@carabos.... (Francesc Altet)',
 'vincent@vincentdavis.... (Vincent Davis)',
 'twood@doc.ic.ac... (Tobias Wood)',
 'michauxkelley@gmail.... (Michaux Kelley)',
 'rlucente@pipeline.... (Robert Lucente)',
 'martyfuhry@gmail.... (Marty Fuhry)',
 'neilcrighton@gmail.... (Neil Crighton)',
 'jrennie@gmail.... (Jason Rennie)',
 'jb.richet at areas.asso.fr (Jean-Baptiste Richet - AREAS)',
 'rcsqtc@iqac.csic... (Ramon Crehuet)',
 'j_r_fonseca at yahoo.co.uk (=?iso-8859-15?Q?Jos=E9?= Fonseca)',
 'arvind.sbia@gmail.... (Arvind Rao)',
 'jstevenson131@gmail.... (Jacob Stevenson)',
 'ashwinsrnth@gmail.... (Ashwin Srinath)',
 'Giovanni.Samaey at cs.kuleuven.ac.be (Giovanni Samaey)',
 'ce@vejnar... (Charles Vejnar)',
 'pnmiller at pacbell.net (Patrick Miller)',
 'berkes@gatsby.ucl.ac... (Pietro Berkes)',
 'aarchiba@physics.mcgill... (Anne Archibald)',
 'vatan.uml@gmail.... (vattan)',
 'aia8v@virginia.... (Alexander Arsenovic)',
 'scott.sinclair.za@gmail.... (Scott Sinclair)',
 'mpeti_ka06 at redffmail.com (MPETI L.KABILA (Jnr))',
 'skybar2002 at hotmail.com (skybar2002 at hotmail.com)',
 'yw5aj@virginia.... (Yuxiang Wang)',
 'nils106@googlemail.... (Nils Wagner)',
 'rossini at u.washington.edu (Anthony Rossini)',
 'pablo.winant@gmail.... (Pablo Winant)',
 'ilanschnell@gmail.... (Ilan Schnell)',
 'ghisvail@gmail.... (Ghislain Vaillant)',
 'Miroslav.Houdek@es... (Miroslav.Houdek@es...)',
 'dalcinl at gmail.com (Lisandro Dalcin)',
 'jason-sage@creativetra... (jason-sage@creativetra...)',
 'ariver@enthought.... (Aaron River)',
 'lists@cheimes... (Christian Heimes)',
 'cevans@evanslabs.... (Constantine Evans)',
 'joerg@joergdietrich.... (Joerg Dietrich)',
 'toddrjen@gmail.... (Todd)',
 'saintmlx@apstat.... (saintmlx)',
 'Dwf@cs.toronto.... (David Warde-Farley)',
 'imakaev@mit.... (Maxim Imakaev)',
 'it@enthought.... (Aaron River via Support)',
 'fabian.pedregosa@inria... (Fabian Pedregosa)',
 'pgmdevlist at gmail.com (Pierre GM)',
 'matt@plot... (Matt Sundquist)',
 'ulrik@42degreesoffreedom.... (=?ISO-8859-1?Q?Ulrik_G=FCnther?=)',
 'karol.langner at kn.pl (Karol Langner)',
 'rob.clewley@gmail.... (Rob Clewley)',
 'jmt@twilley.... (Jack Twilley)',
 'rmay31@gmail.... (Ryan May)',
 'bradford.n.cross@gmail.... (Bradford Cross)',
 'vladimir.rdguez@gmail.... (=?ISO-8859-1?Q?Vlad=EDmir_Rodr=EDguez?=)',
 'derek at uni-sw.gwdg.de (Derek Homeier)',
 'martin.hoefling at gmx.de (Martin =?utf-8?q?H=C3=B6fling?=)',
 'rkramer@systemsbiology.... (Roger Kramer)',
 'leek@broadinstitute.... (Lee Kamentsky)',
 'jesper.friis at material.ntnu.no (Jesper Friis)',
 'blake.a.griffith@gmail.... (Blake Griffith)',
 'chadfulton@gmail.... (Chad Fulton)',
 'eigenspaces@gmail.... (Patrick "Kai" Baker)',
 'tufkittycat at yahoo.com (kittycat tuff)',
 'adamadamadamamiadam at gmail.com (CakeProphet)',
 'adrian.prw@gmail.... (Adrian)',
 'gombiuda@gmail.... (gombiuda JHL)',
 'otttr440 at student.liu.se (Otto Tronarp)',
 'alan.mcintyre@gmail.... (Alan McIntyre)',
 'michaeljgruber+scipy at fastmail.fm (Michael J Gruber)',
 'seefeld@sympatico... (Stefan Seefeld)',
 'finnefro at sas.upenn.edu (Adam C. Finnefrock)',
 'asfandyar_k at yahoo.com (Asfandyar Khan)',
 'bouloumag@gmail.... (Darcoux Christine)',
 'val at vtek.com (val)',
 "dosu004@sges.auckland.ac... (David O'Sullivan)",
 'gisxperts@web... (hannes)',
 'jmiller at stsci.edu (Todd Miller)',
 'jaime.frio@gmail.... (=?ISO-8859-1?Q?Jaime_Fern=E1ndez_del_R=EDo?=)',
 'chaugan at visi.com (Carl Haugan)',
 'erickt at dslextreme.com (Erick Tryzelaar)',
 'kmichael.aye@gmail.... (K.-Michael Aye)',
 'mingarelli@gmail.... (Chiara Mingarelli)',
 'dvp at mwl.MIT.EDU (Dennis V. Perepelitsa)',
 'vel.accel at gmail.com (Deiter Hering)',
 'magnus at thinkware.se (Magnus =?iso-8859-1?Q?Lyck=E5?=)',
 'bpederse@gmail.... (Brent Pedersen)',
 'rlytle at tqs.com (Lytle, Robert TQO)',
 'jhsh at sun.ac.za (Jannie Hofmeyr)',
 'brendansimons at yahoo.ca (Brendan Simons)',
 'rob at pythonemproject.com (Rob)',
 'jaakko.luttinen@aalto... (Jaakko Luttinen)',
 'mcseem at antigrain.com (Maxim Shemanarev)',
 'rnelsonchem@gmail.... (Ryan Nelson)',
 'Chris.Fonnesbeck at MyFWC.com (Fonnesbeck, Chris)',
 'fred.jen@web... (Fred Jendrzejewski)',
 'venkatraghavan at gmail.com (Venkat Raghavan V.C.)',
 'ryan@rdodesigns.... (Ryan Orendorff)',
 'Ralf_Ahlbrink at web.de (Ralf Ahlbrink)',
 'wnbell@gmail.... (Nathan Bell)',
 'jhauser at ifm.uni-kiel.de (Janko Hauser)',
 'davidmenhur@gmail.... (=?UTF-8?B?RGHPgGlk?=)',
 'saullogiovani@gmail.... (Saullo Castro)',
 'pasky@ucw... (Petr Baudis)',
 'wnbell at gmail.com (Nathan Bell)',
 'aahz@pythoncraft.... (Aahz)',
 'dd55 at cornell.edu (Darren  Dale)',
 'joscha.schmiedt@googlemail.... (Joscha Schmiedt)',
 'msyang@princeton.... (Michael Yang)',
 'theodore.goetz@gmail.... (Johann Goetz)',
 'w.f.alexander at ieee.org (Bill Alexander)',
 'answer at tnoo.net (=?iso-8859-1?q?Martin_L=FCthi?=)',
 'jtaylor.debian@googlemail.... (Julian Taylor)',
 'conradlee@gmail.... (Conrad Lee)',
 'altis at semi-retired.com (Kevin Altis)',
 'n_habili at hotmail.com (Nariman Habili)',
 'rkern at ucsd.edu (Robert Kern)',
 'dturgut@gmail.... (Deniz Turgut)',
 'lawnmowerman_0 at yahoo.com (Jun Sung)',
 'jh at oobleck.astro.cornell.edu (Joe Harrington)',
 'pierre.gueth@free... (Pierre Gueth)',
 'jh@physics.ucf.... (Joe Harrington)',
 'wingusr@gmail.... (TP)',
 'aisaac@american.... (Alan G Isaac)',
 'zpincus at stanford.edu (Zachary Pincus)',
 'nonhermitian@gmail.... (Paul Nation)',
 'nmarais@sun.ac... (Neilen Marais)',
 'mueller@imt.uni-luebeck... (=?iso-8859-1?Q?Jan_M=FCller?=)',
 'berthold@xn--hllmanns-n4a... (=?utf-8?q?Berthold_H=C3=B6llmann?=)',
 'gregc at cgl.ucsf.edu (Greg Couch)',
 'ivazquez at ivazquez.net (Ignacio Vazquez-Abrams)',
 'PHobson@Geosynte... (PHobson@Geosynte...)',
 'gillet at scripps.edu (Alexandre Gillet)',
 'jenny.stone125@gmail.... (Jennifer Janani)',
 'hadsed@gmail.... (Hadayat Seddiqi)',
 'zhangj.sdu@gmail.... (Zhang Jiang)',
 'b.telenczuk@biologie.hu-berlin... (Bartosz Telenczuk)',
 'hoytak@stat.washington.... (Hoyt Koepke)',
 'cookedm at physics.mcmaster.ca (David M. Cooke)',
 'kriehl@enthought.... (Katrina Riehl)',
 'pearu at scipy.org (Pearu)',
 'boyfarrell@gmail.... (Daniel Farrell)',
 'junkshops@gmail.... (Junkshops)',
 'ej at ee.duke.edu (Eric A. Jones)',
 'ellisonbg.net@gmail.... (Brian Granger)',
 'schmidtc@gmail.... (Charles R. Schmidt)',
 'metaperl@gmail.... (Terrence Brannon)',
 'fonnesbeck at gmail.com (Chris Fonnesbeck)',
 'forkandwait@gmail.... (fork)',
 'g.durin@inrim... (Gianfranco Durin)',
 'Norbert.Nemec.list at gmx.de (Norbert Nemec)',
 'gavishnoam@gmail.... (Noam Gavish)',
 'pearu at scipy.org (pearu at scipy.org)',
 'edwardyoon@apache.... (Edward J. Yoon)',
 'krzysztof.berniak@gmail.... (Krzysztof Berniak)',
 'amachnik@gmail.... (Adam Machnik)',
 'byrnes at bu.edu (John Byrnes)',
 'cburns@berkeley.... (Christopher Burns)',
 'travis at vaught.net (Travis N. Vaught)',
 'ggellner@uoguelph... (Gabriel Gellner)',
 'djpine@gmail.... (David Pine)',
 'steve at shrogers.com (Steven H. Rogers)',
 'z8y6p5away at yahoo.ca (Elinor Wills)',
 'delphiro at zonnet.nl (delphiro)',
 'strawman at astraw.com (Andrew Straw)',
 'roybryant at SEVENtwentyfour.com (Roy Bryant)',
 'gharras@ethz... (gharras)',
 'sans at clinton.com (Anderi Smiranov)',
 'aia8v@virginia.... (alex arsenovic)',
 'tjlahey at cgl.uwaterloo.ca (Tim Lahey)',
 'spacey-scipy-dev@lenin.... (Peter C. Norton)',
 'xmichael-k@web... (Michael =?utf-8?b?S8OkcHBsZXI=?=)',
 '00ai99@gmail.... (David Gowers)',
 'mirteney@gmail.... (Anton Litvinov)',
 'jaime.frio@gmail.... (=?UTF-8?Q?Jaime_Fern=C3=A1ndez_del_R=C3=ADo?=)',
 'jjstickel@gmail.... (Jonathan Stickel)',
 'marlukumba at rediffmail.com (marlukumba at rediffmail.com)',
 'pebarrett@gmail.... (Paul Barrett)',
 'danielsjensen1@gmail.... (Daniel Jensen)',
 'emailharvest at email.com (emailharvest at email.com)',
 'humongo.shi@gmail.... (Hugo Shi)',
 'steven.robbins at videotron.ca (Steve M. Robbins)',
 'forrest.bao@gmail.... (Forrest Sheng Bao)',
 'post.moni@t-online... (Monika)',
 'cclarke@chrisdev.... (Christopher Clarke)',
 'larshendrikfrahm@googlemail.... (Lars-Hendrik Frahm)',
 'adam at scipy.com (Adam Barker)',
 'lanceboyle@qwest.... (Jerry)',
 'tritemio@gmail.... (Antonio)',
 'oscar.bristol@googlemail.... (Oscar Benjamin)',
 'cimrman3 at ntc.zcu.cz (Robert Cimrman)',
 'thamelry at binf.ku.dk (thamelry at binf.ku.dk)',
 'p.c.degroot@tudelft... (Pieter Cristiaan de Groot)',
 'Promotioner at eyou.com (Customer Services)',
 'kamrik at gmail.com (Mark Koudritsky)',
 'ashuang at gmail.com (Albert Huang)',
 'farmckon@gmail.... (Far McKon)',
 'sebastian.schoeller@gmail.... (Sebastian Schoeller)',
 'aric.hagberg@gmail.... (Aric Hagberg)',
 'zephyr14@gmail.... (Vlad Niculae)',
 'thomas.haslwanter@fh-linz... (Thomas Haslwanter)',
 'dmp2000 at mail.ru (Andre Alves)',
 'Ralf_Ahlbrink@web... (Ralf Ahlbrink)',
 'xavier.gnata@gmail.... (Xavier Gnata)',
 't.zito at biologie.hu-berlin.de (Tiziano Zito)',
 'derek at physast.uga.edu (Derek Homeier)',
 'raphael.leplae@ulb.ac... (Raphael Leplae)',
 'nmarais at sun.ac.za (Neilen Marais)',
 'awebster@falsecolou... (awebster@falsecolou...)',
 'nouiz@nouiz.... (=?ISO-8859-1?Q?Fr=E9d=E9ric_Bastien?=)',
 'subha_rajk@yahoo.... (Subha Raj)',
 'lee.j.joon at gmail.com (Jae-Joon Lee)',
 'Thomas.Haslwanter@fh-linz... (Haslwanter Thomas)',
 'daniel.asenjo@gmail.... (Daniel Asenjo)',
 'gael.varoquaux@normalesup.... (Gael Varoquaux)',
 'fedor at mailandnews.com (Fedor Baart)',
 'collinstocks@gmail.... (Collin Stocks)',
 'jeffrey.b.layton at lmco.com (Jeff Layton)',
 'oliphant.travis at ieee.org (Travis E. Oliphant)',
 'clovisgo at gmail.com (Clovis Goldemberg)',
 'kamrik@gmail.... (Mark Koudritsky)',
 'Brenda8367j31 at msn.com (Brenda8367j31 at msn.com)',
 'alexh@psych.usyd.edu... (Alex Holcombe)',
 'jeff.grasty@gmail.... (Jeff Grasty)',
 'cmiller730@gmail.... (Christopher Miller)',
 'dave.hirschfeld@gmail.... (Dave)',
 'ttomecek@redhat.... (Tomas Tomecek)',
 'Andreas.Floeter@web... (Andreas =?iso-8859-1?q?Fl=F6ter?=)',
 'suryak@ieee.... (Surya Kasturi)',
 'ajvogel@tuks.co... (Adolph J. Vogel)',
 'gregor.thalhammer@gmail.... (Gregor Thalhammer)',
 'ndbecker2 at gmail.com (Neal Becker)',
 'fbreitling@aip... (Frank Breitling)',
 'Linda.Gilbert@WPAFB.AF.... (Gilbert, Linda M CTR USAF AFRL/RXOC)',
 'jstrunk@enthought.... (Jeff Strunk)',
 'gpichon@enseirb... (=?ISO-8859-1?Q?Gr=E9goire_Pichon?=)',
 'philbinj@gmail.... (James Philbin)',
 'jordigh@octave.... (=?UTF-8?Q?Jordi_Guti=C3=A9rrez_Hermoso?=)',
 'nikolai.hlubek at mailbox.tu-dresden.de (Nikolai Hlubek)',
 'cjf at fonnesbeck.net (Christopher Fonnesbeck)',
 'luis94855510@gmail.... (Luis Saavedra)',
 'eric.moscardi@sophia.inria... (moscardi)',
 'mforbes@physics.ubc... (Michael McNeil Forbes)',
 'ruediger.kessel@gmail.... (=?utf-8?b?UsO8ZGlnZXI=?= Kessel)',
 'jos_edw at mail.com (JOSEPH  EDWARD.)',
 'akshar.bhosale@gmail.... (akshar bhosale)',
 'gnchen@cortechs.... (Gennan Chen)',
 'tillsten@zedat.fu-berlin... (Till Stensitzki)',
 'bdeb@willmore... (Ben Willmore)',
 'mellerf@netvision.net... (Yosef Meller)',
 'charles.moliere@gmail.... (Charles Moliere)',
 'Adam.Raczkowski@tufts.... (AJ Raczkowski)',
 "dog at ERC.MsState.Edu (David O'Gwynn)",
 'eric at enthought.com (Eric Jones)',
 'dmorrill at austin.rr.com (David C. Morrill)',
 'meruma at rio.odn.ne.jp (=?ISO-2022-JP?B?GyRCTDVOQSVXJWwlPCVzJUgbKEI=?=)',
 'rohangoel0296@gmail.... (Rohan Goel)',
 'jvankers@gmail.... (Joris Vankerschaver)',
 'lists_ravi@lavabit.... (Ravi)',
 'andy.terrel@gmail.... (Andy Ray Terrel)',
 'robince@gmail.... (Robin)',
 'Alex.Perry at ge.com (Perry, Alexander (GE Infrastructure))',
 'adityashah30@gmail.... (Aditya Shah)',
 'targetemailextractor at btamail.net.cn (targetemailextractor at btamail.net.cn)',
 'v-nijs at kellogg.northwestern.edu (Vincent Nijs)',
 'mdroe@stsci.... (Michael Droettboom)',
 'richard9404@gmail.... (Richard Tsai)',
 'renato.francisco.amaral@gmail.... (Renato Francisco G. Amaral)',
 'swisher@enthought.... (Janet Swisher)',
 'igorsyl@gmail.... (Igor Sylvester)',
 'wbaxter@gmail.... (Bill Baxter)',
 'waller@guldbyn... (Stefan Waller)',
 'nwagner at mecha.uni-stuttgart.de (Nils Wagner)',
 'golux at comcast.net (Stephen C. Waterbury)',
 'novak at ucolick.org (Greg Novak)',
 'ryorke at telkomsa.net (Rory Yorke)',
 'kern at vomit.ugcs.caltech.edu (Robert Kern)',
 'fred.mailhot at gmail.com (Fred Mailhot)',
 'smith.daniel.br@gmail.... (Daniel Smith)',
 'mhpc.edas@gmail.... (MHPC 2013)',
 'andrea.gavana@gmail.... (Andrea Gavana)',
 'staceyl at aol.com (Weldon Crosby)',
 'david@silveregg.co... (David Cournapeau)',
 'thomas.robitaille@gmail.... (Thomas Robitaille)',
 'svr6ucxx5t at foia.com (Phoebe Hobbs)',
 'contrebasse@gmail.... (Joseph Martinot-Lagarde)',
 'bgoli at sun.ac.za (Brett G. Olivier)',
 'daniel.wheeler2@gmail.... (Daniel Wheeler)',
 'mathieu@mblondel.... (Mathieu Blondel)',
 'smattacus@gmail.... (Sean Mattingly)',
 'emmanuelle.gouillart@normalesup.... (Emmanuelle Gouillart)',
 'avi@sicortex.... (Avi Purkayastha)',
 'ndbecker2@gmail.... (Neal Becker)',
 'yversley@gmail.... (Yannick Versley)',
 'chanley@stsci.... (Christopher Hanley)',
 'krivilli at unberwoot.net (krivilli at unberwoot.net)',
 'barnard at stat.harvard.edu (John Barnard)',
 'kirklin.mcdonald@gmail.... (Kirk McDonald)',
 'asmund.hjulstad@gmail.... (=?ISO-8859-1?Q?=C5smund_Hjulstad?=)',
 'jeff at taupro.com (Jeff Rush)',
 'stefan@sun.ac... (Stefan van der Walt)',
 'mkuemmel at eso.org (Martin Kuemmel)',
 'jordens@gmail.... (=?UTF-8?Q?Robert_J=C3=B6rdens?=)',
 'luca.penasa@gmail.... (Luca Penasa)',
 'millman at berkeley.edu (Jarrod Millman)',
 'r.w.lincoln@gmail.... (Richard Lincoln)',
 'Ralf.Juengling@synopsys.... (Ralf Juengling)',
 'doutriaux1@llnl.... (=?UTF-8?Q?Charles_=D8=B3=D9=85=D9=8A=D8=B1_Doutriaux?=)',
 'rossini at blindglobe.net (A.J. Rossini)',
 'leek@broad.mit.... (Lee Kamentsky)',
 'millman@berkeley.... (Jarrod Millman)',
 'd_l_goldsmith@yahoo.... (David Goldsmith)',
 'd.warde.farley@gmail.... (David Warde-Farley)',
 'm.boumans@gmx.... (bowie_22)',
 'juanlu001@gmail.... (Juan Luis Cano)',
 'christophe.grimault at novagrid.com (christophe grimault)',
 'a.mcmorland at auckland.ac.nz (Angus McMorland)',
 'Michael.J.Culbertson at wheaton.edu (Michael Culbertson)',
 'aowoyade at popmail.com (DR. OWOYADE ADEDEJI)',
 'staceylong at aol.com (Byron Oconnor)',
 'lmk2g at mail.com (LAURENT MPETI KABILA)',
 'alexei@alexeicolin.... (Alexei Colin)',
 'denis-bz-gg@t-online... (denis)',
 'dd55 at cornell.edu (Darren Dale)',
 'robin at alldunn.com (Robin Dunn)',
 'mariam_y2k_abacha at yahoo.com (ALHAJI MOHAMMED SANI ABACHA)',
 'jeremit0 at gmail.com (Jeremy Conlin)',
 'gyromagnetic@gmail.... (Gyro Funch)',
 'stefan.peterson@rubico.... (stefan)',
 'npkuin@gmail.... (Paul Kuin)',
 'stefan at sun.ac.za (Stefan van der Walt)',
 'malkarouri@yahoo.co... (Muhammad Alkarouri)',
 'clarence at broad.mit.edu (clarence at broad.mit.edu)',
 'thomas.haslwanter@alumni.ethz... (thomash)',
 'ericq@caltech.... (Eric Quintero)',
 'wilna at sun.ac.za (Du Toit, Wilna <wilna@sun.ac.za>)',
 'joosep.pata@gmail.... (Joosep Pata)',
 'jsseabold@gmail.... (Skipper Seabold)',
 'robertlayton@gmail.... (Robert Layton)',
 'goxberry@gmail.... (Geoff Oxberry)',
 'mtrumpis@berkeley.... (M Trumpis)',
 'oliphant at ee.byu.edu (Travis E. Oliphant)',
 'cs770@york.ac... (Clare Sutherland)',
 'n.dumoulin at arverne.homelinux.org (Nicolas Dumoulin)',
 'kkwweett@hotmail... (Paul Langevin)',
 'venkatraghavan at gmail.com (Venkat Raghavan)',
 'adamrossbarker at hotmail.com (Adam Barker)',
 'Simon.Tournier@alumni.enseeiht... (simon tournier)',
 'chrisk at MIT.EDU (ChrisK)',
 'argriffi@ncsu.... (alex)',
 'a.u.r.e.l.i.a.n at gmx.net (Johannes Loehnert)',
 'derek@astro.physik.uni-goettingen... (Derek Homeier)',
 'c.khroulev@gmail.... (Constantine Khroulev)',
 'resurgo@gmail.... (Peter Clarke)',
 'ben.root@ou.... (Benjamin Root)',
 'alan@ajackso... (alan@ajackso...)',
 'bhoel at web.de (=?iso-8859-15?q?Berthold_H=F6llmann?=)',
 'oliphant@ee.byu.... (Travis Oliphant)',
 'DavidA at ActiveState.com (David Ascher)',
 'gustavo.goretkin@gmail.... (Gustavo Goretkin)',
 'martinez@isg.cs.uni-magdeburg... (Janick Martinez Esturo)',
 'james@NBN.ac... (James Dominy)',
 'pav+sp@iki... (Pauli Virtanen)',
 'joseph.martinot-lagarde@m4x.... (Joseph Martinot-Lagarde)',
 'eric@enthought.... (Eric Jones)',
 'lists@hilboll... (Andreas Hilboll)',
 'hofsaess@ifb.uni-stuttgart... (=?UTF-8?B?TWFydGluIEhvZnPDpMOf?=)',
 'charris208 at attbi.com (Charles R Harris)',
 'anton.soppelsa@igi.cnr... (Soppelsa Anton)',
 'spacey-scipy-dev@lenin.... (Peter Norton)',
 'jhn@phys.au... (Jens Nielsen)',
 'marmaduke.woodman@univ-amu... (Marmaduke Woodman)',
 'js at jeannot.org (Jean-Sebastien Roy)',
 '00ai99@gmail.... (David Gowers (kampu))',
 'gelston@doosanbabcock.... (Elston, Gareth R)',
 'jakevdp@gmail.... (Jake VanderPlas)',
 'scipy@samueljohn... (Samuel John)',
 'jarausch@igpm.rwth-aachen... (Helmut Jarausch)',
 'josegomez at gmx.net (=?iso-8859-15?q?Jos=E9=20Luis=20G=F3mez=20Dans?=)',
 'hoytak@cs.ubc... (Hoyt Koepke)',
 'punchagan@gmail.... (Puneeth Chaganti)',
 'neyhmor@gmail.... (N H)',
 'wcardoen@gmail.... (Wim R. Cardoen)',
 'schaouette@free... (Gilles G.)',
 'doutriaux1@llnl.... (Charles Doutriaux)',
 'vacaciones at en-orlando.com (Vacaciones So\xf1adas)',
 'M.Reimpell at tu-bs.de (Michael Reimpell)',
 'russel at appliedminds.net (Russel)',
 'cournape@gmail.... (David Cournapeau)',
 'jmr at engineering.uiowa.edu (Joe Reinhardt)',
 'mattknox_ca@hotmail.... (Matt Knox)',
 '237995 at yahoo.com (237995 at yahoo.com)',
 'lars.bittrich@googlemail.... (Lars Bittrich)',
 'emkt at jongstit.com (Ms. Prim S.)',
 'martin.druon@wanadoo... (Martin DRUON)',
 'fwereade@googlemail.... (William Reade)',
 'rramachandran at itsc.uah.edu (Rahul Ramachandran)',
 'josh.holbrook@gmail.... (Joshua Holbrook)',
 'yoshiki89@gmail.... (Yoshiki Vazquez Baeza)',
 'chris.felton@gmail.... (Christopher Felton)',
 'james.bergstra@gmail.... (James Bergstra)',
 'h.callow at elec.canterbury.ac.nz (Hayden John Callow)',
 'njs@pobox.... (Nathaniel Smith)',
 'saintmlx@apstat.... (Xavier Saint-Mleux)',
 'pwang@enthought.... (Peter Wang)',
 'p.Collard at i-net.paiko.gr (p.Collard at i-net.paiko.gr)',
 'hoel at germanlloyd.org (=?iso-8859-15?Q?Berthold_H=F6llmann?=)',
 'Fernando.Perez at colorado.edu (Fernando Perez)',
 'mhaligowski@googlemail.... (halish)',
 'david@silveregg.co... (David)',
 'freedryk at ocean.washington.edu (Jordan Dawe)',
 'harkal at sylphis3d.com (harkal at sylphis3d.com)',
 'tony@maths.lth... (Tony Stillfjord)',
 'cookedm@physics.mcmaster... (David M. Cooke)',
 'bratona@yahoo.co... (Adam Malinowski)',
 'ericfode@gmail.... (Fode)',
 'meine@informatik.uni-hamburg... (Hans Meine)',
 'mpeti_k at mail.com (Laurent Mpeti Kabila)',
 'aizvorski@gmail.... (Alexander Izvorski)',
 'alexandre.fayolle at logilab.fr (Alexandre Fayolle)',
 'fullung@gmail.... (Albert Strasheim)',
 'andrei.avk@gmail.... (AK)',
 'mcoletti@gmail.... (Mark Coletti)',
 'tpk@kraussfamily.... (Tom K.)',
 'amenity@enthought.... (Amenity Applewhite)',
 'aaaagrawal@gmail.... (Ankit Agrawal)',
 'clemens@familie-novak.... (Clemens Novak)',
 'brianhone@gmail.... (Brian Hone)',
 'jh@physics.uc... (jh@physics.uc...)',
 'tom.grydeland@gmail.... (Tom Grydeland)',
 'gokhansever@gmail.... (=?UTF-8?Q?G=C3=B6khan_Sever?=)',
 'eads@soe.ucsc.... (Damian Eads)',
 'rjsm@umich.... (ross smith)',
 'bjorn.forsman@gmail.... (=?UTF-8?Q?Bj=C3=B8rn_Forsman?=)',
 'mount.sarah@gmail.... (Sarah Mount)',
 'Robin.K.Friedrich at usa-spaceops.com (Friedrich, Robin K)',
 'pmarks at gmail.com (Patrick Marks)',
 'ravi.rajagopal@amd.... (Ravikiran Rajagopal)',
 'mario.mulansky@gmx.... (Mario Mulansky)',
 'wbaxter at gmail.com (Bill Baxter)',
 'bussonniermatthias@gmail.... (Matthias BUSSONNIER)',
 'lomegor@gmail.... (=?ISO-8859-1?Q?Sebasti=E1n_Ventura?=)',
 'salsifis at free.fr (Salsifis)',
 'vitali@hkl.hms.harvard.... (Vitali V)',
 'cfo6hwnji at yahoo.com.hk (Lemuel Lowe)',
 'robert.kern@gmail.... (Robert Kern)',
 'fedor.baart at hccnet.nl (Fedor Baart)',
 'info at kestlerfinancial.com (Kestler Financial Group)',
 'harenma at netscape.net (MANPENSA DONDO.)',
 'bart.vandereycken at cs.kuleuven.be (Bart Vandereycken)',
 'tisham@apogee.com... (Tisham Dhar)',
 '14366037@sun.ac... (Gouws, Almero <14366037@sun.ac.za>)',
 'bhoel at web.de (Berthold Hoellmann)',
 'schut@sarvision... (Vincent Schut)',
 'tinu at email.ch (tinu)',
 'scipyman at scipy.org (Scipy Manager)',
 'mcp.stras@gmail.... (Martin Campos Pinto)',
 'sebastian.walter@gmail.... (Sebastian Walter)',
 'myeates at jpl.nasa.gov (Mathew Yeates)',
 'alexlib@eng.tau.ac... (Alex Liberzon)',
 'debian@delanoe.... (Alexandre =?iso-8859-1?Q?Delano=EB?=)',
 'zufus@zufus.... (Marco Presi)',
 'matthew.brett@gmail.... (Matthew Brett)',
 'futures at casstel.net (Trump Trading Report)',
 'michael.forbes+python@gmail.... (Michael McNeil Forbes)',
 'david.douard@logilab... (David Douard)',
 'g.vanuxem at wanadoo.fr (Vanuxem =?ISO-8859-1?Q?Gr=E9gory?=)',
 'openopt@ukr.... (dmitrey)',
 'niall.moran@gmail.... (Niall Moran)',
 'jascha@stanford.... (Jascha Sohl-Dickstein)',
 'joris.vankerschaver@gmail.... (J Vankerschaver)',
 'greglandrum at earthlink.net (greg landrum)',
 'j.j.green@gmx... (J.J. Green)',
 'pmhobson@gmail.... (Paul Hobson)',
 'johann.cohentanugi@gmail.... (Johann Cohen-Tanugi)',
 'matteo@naufraghi.... (Matteo Bertini)',
 'robert.milgate@gmail.... (Robert Milgate)',
 'boyfarrell@gmai... (boyfarrell@gmai...)',
 'loredo at astro.cornell.edu (Tom Loredo)',
 'fperez at colorado.edu (Fernando Perez)',
 'scipy-dev at scipy.net (Neal D. Becker)',
 'suzanne at theportal.net (=?ISO-8859-1?Q?The Portal?=)',
 'arnd.baecker at web.de (Arnd Baecker)',
 'irvin.probst@ensta-bretagne... (Irvin Probst)',
 'josef.pktd@gmai... (josef.pktd@gmai...)',
 'r.c.bruno.andre@gmail.... (=?ISO-8859-1?Q?Bruno_Andr=E9_Rodrigues_Coelho?=)',
 'openopt@ukr.... (Dmitrey Kroshko)',
 'bluesquall@gmail.... (M J Stanway)',
 'princessjoyce at totalise.co.uk (Princess Joyce Abonime)',
 'matti.pastell@helsinki... (Matti Pastell)',
 'piesiewicz at pb.izm.fhg.de (Radoslaw Piesiewicz)',
 'rbastian@free... (R. Bastian)',
 'john@curioussymbols.... (John Pye)',
 'loluengo@gmail.... (Lorenzo Luengo)',
 'novin01@gmail.... (Dave Hirschfeld)',
 'D.J.Baker@soton.ac... (Baker D.J.)',
 'helias@bccn.uni-freiburg... (Moritz Helias)',
 'anthony.seward at ieee.org (Anthony Joseph Seward)',
 'rkern at UCSD.EDU (Robert Kern)',
 'zunzun@zunzun.... (James Phillips)',
 'stuart@mumford.me... (Stuart Mumford)',
 'zachary.pincus@yale.... (Zachary Pincus)',
 'answer@tnoo.... (Martin =?iso-8859-1?Q?L=FCthi?=)',
 'werner.ho@gmx... (Werner Hoch)',
 'lkb.teichmann@gmail.... (Martin Teichmann)',
 'argriffi@ncsu.... (Alex Griffing)',
 'dmorrill at scipy.org (David C. Morrill)',
 'andorxor at gmx.de (Stephan Tolksdorf)',
 'drewdemento at yahoo.com (Andrew D)',
 'magetter@gmx... (Dominik Maxein)',
 'as8ca@virginia.... (Alok Singhal)',
 'ej at ee.duke.edu (eric jones)',
 'hoel at gl-group.com (=?iso-8859-15?Q?Berthold_H=F6llmann?=)',
 'sinclaird@ukzn.ac... (Scott Sinclair)',
 'fishbrain at crackz.i989.net (fishbrain at crackz.i989.net)',
 'jonathan.taylor at stanford.edu (Jonathan Taylor)',
 'valentine.c@husky.neu.... (Clint Valentine)',
 'taketu at lapis.plala.or.jp (=?ISO-2022-JP?B?GyRCJCo2YkJfJDckXiQ5GyhC?=)',
 'loredo@astro.cornell.... (Tom Loredo)',
 'evgeny.burovskiy@gmail.... (Evgeni Burovski)',
 'russellcecala at netscape.net (Russell Cecala)',
 'timvictor@gmail.... (Tim Victor)',
 'berthold.hoellmann at gl-group.com (=?iso-8859-15?Q?_Berthold_H=F6llmann?=)',
 'Nicolas.Rougier@inria... (Nicolas Rougier)',
 'hofsaess@ifb.uni-stuttgart... (=?ISO-8859-15?Q?Martin_Hofs=E4=DF?=)',
 'martin.hoefling at gmx.de (Martin =?iso-8859-1?q?H=F6fling?=)',
 'Norman.Shelley at motorola.com (Norman Shelley)',
 'djpine@gmail.... (David J Pine)',
 'cl9n8gao2ui at topprodsource.com (Liz Jordan)',
 'vel.accel at gmail.com (dHering)',
 'befelix@ethz... (Felix Berkenkamp)',
 'Marketinger at yaomail.vicp.net (Judy)',
 'm.hofsaess@gmail.... (=?UTF-8?B?TWFydGluIEhvZnPDpMOf?=)',
 'Norbert.Nemec.List@gmx... (Norbet Namec)',
 'jochen at unc.edu (Jochen =?iso-8859-1?q?K=FCpper?=)',
 'denis.engemann@gmail.... (Denis A. Engemann)',
 'james.analytis at physics.ox.ac.uk (James G Analytis)',
 'gurunath_s_r at yahoo.co.in (=?iso-8859-1?q?gurunath=20rampur?=)',
 'pearu at cens.ioc.ee (pearu at cens.ioc.ee)',
 'martin.teichmann@mbi-berlin... (Martin Teichmann)',
 'rudolph@ska.ac... (Rudolph van der Merwe)',
 'ns2szu5i6xf at yahoo.ca (Frances Mccullough)',
 'hoytak@gmail.... (Hoyt Koepke)',
 'greenfield at comcast.net (Perry Greenfield)',
 'berthold.hoellmann at gl-group.com (=?ISO-8859-15?Q?Berthold_H=F6llmann?=)',
 'krunk7@gmail.... (James Kyle)',
 'perry@stsci.... (Perry Greenfield)',
 'lpc@cmu.... (Luis Pedro Coelho)',
 'jnl at allegro.mit.edu (J Nicholas Laneman)',
 'mattknox.ca@gmail.... (Matt Knox)',
 'kingdomedu at access-4-free.com (Covenant Academy)',
 'sam.m.birch@gmail.... (Sam Birch)',
 'apps.embedded@gmail.... (Apps Embedded)',
 'twilson at eduplay.com (Tyler W. Wilson)',
 'lguzzardi@gmail.... (luca guzzardi)',
 'fboulogne@sciunto.... (=?ISO-8859-1?Q?Fran=E7ois_Boulogne?=)',
 'hazmat at objectrealms.net (Kapil Thangavelu)',
 'josh@fiveyearitch.... (Josh Fox)',
 'denis-bz-py@t-online... (denis)',
 'abhikshah@gmail.... (Abhik Shah)',
 'scipy@mspacek.mm... (Martin Spacek)',
 'm.hofsaess@gmail.... (=?ISO-8859-1?Q?Martin_Hofs=E4=DF?=)',
 'stefan@sun.ac... (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=)',
 'eavventi@yahoo... (Enrico Avventi)',
 'kern at caltech.edu (Robert Kern)',
 'russellcecala at netscape.net (russell cecala)',
 'peter.demarest@gmail.... (Peter Demarest)',
 'pgmdevlist@gmail.... (Pierre GM)',
 'joe@nowsol.... (Joe Covalesky)',
 'promoters at selvacouters.com.br (Marcos)',
 'benfrantzdale@gmail.... (Ben FrantzDale)',
 'fuoss@anl.... (Paul Fuoss)',
 'wagner.nils at vdi.de (My VDI Freemail)',
 'mail@stevesimmons.... (Stephen Simmons)',
 'ognen@enthought.... (Ognen Duzlevski)',
 'freakhouse at rock.com (freakhouse)',
 'slesarev.anton@gmail.... (Anton Slesarev)',
 'mierle@gmail.... (Keir Mierle)',
 'largeddysimulation@gmail.... (Nikola Mirkov)',
 'pivanov314@gmail.... (Paul Ivanov)',
 'Robin.K.Friedrich@usa-spaceops.... (Friedrich, Robin K)',
 'zelbier@gmail.... (Olivier Verdier)',
 'rajsai24@gmail.... (Sai Rajeshwar)',
 'd.l.goldsmith@gmail.... (David Goldsmith)',
 'Car_Quotes at topnotchrewards.com (Car_Quotes at topnotchrewards.com)',
 'k.h.gillen@dundee.ac... (Kenneth Gillen)',
 'nouiz@nouiz.... (=?UTF-8?B?RnLDqWTDqXJpYyBCYXN0aWVu?=)',
 'fishburn at MIT.EDU (Matt Fishburn)',
 'cebizzinetr at centerlink.com.br (Elmer)',
 'aron@ahmadia.... (Aron Ahmadia)',
 'matthewturk@gmail.... (Matthew Turk)',
 'daw@clemson.... (Murray Daw)',
 'bogus@does.not.exist.com ()',
 'russel@appliedminds.... (Russel Howe)',
 'peter.skomoroch@gmail.... (Peter Skomoroch)',
 'aza at uchicago.edu (Aza Raskin)',
 'eric at enthought.com (eric)',
 'rosaliehayes_nm at australiandomainregistration.com (Rosalie Hayes)',
 'Laurence.Viry@imag... (Laurence Viry)',
 'jtravs at gmail.com (John Travers)',
 'bertle at smoerz.org (Roman Bertle)',
 'eki102 at mail.com (EKI OLUWA)',
 'andreamilazzo@mancausoft.... (Mancausoft)',
 'larson.eric.d@gmail.... (Eric Larson)',
 'paul.anton.letnes@gmail.... (Paul Anton Letnes)',
 'drizzd@aon... (Clemens Buchacher)',
 'hagberg@lanl.... (Aric Hagberg)',
 'harkal at sylphis3d.com (Harry Kalogirou)',
 'jackliddlephysics@googlemail.... (Jack Liddle)',
 'survinderpal@gmail.... (survinder pal)',
 'turian@gmail.... (Joseph Turian)',
 'ewm@redtetrahedron.... (Eric Moore)',
 'prabhu at aero.iitb.ac.in (Prabhu Ramachandran)',
 'rowen@uw.... (Russell E. Owen)',
 'jmobutu457 at rediffmail.com (joseph mobutu)',
 'arnd.baecker at physik.uni-ulm.de (arnd.baecker at physik.uni-ulm.de)',
 'rogerlew@gmail.... (Roger Lew)',
 'falted at pytables.org (Francesc Alted)',
 '236626 at delphi.com (236626 at delphi.com)',
 'mforbes at alum.MIT.EDU (Michael Forbes)',
 'rhys.ulerich@gmail.... (Rhys Ulerich)',
 'tim.leslie at gmail.com (Tim Leslie)',
 'eric at scipy.org (eric)',
 'jrjohansson@gmai... (jrjohansson@gmai...)',
 'michael.abshoff@googlemail.... (Michael Abshoff)',
 'patmiller at llnl.gov (Patrick Miller)',
 'datafeed at SoftHome.net (M. Evans)',
 'mani.sabri@gmail.... (mani sabri)',
 'johannes.kulick@ipvs.uni-stuttgart... (Johannes Kulick)',
 'jba@SDF.LONESTAR.... (Jeffrey Armstrong)',
 'gareth.elston.floss@googlemail.... (Gareth Elston)',
 'fnoe at gmx.net (Frank Noe)',
 'pierre.haessig@crans.... (Pierre Haesssig)',
 'numberschristian_qv at 3com.com (Numbers Christian)',
 'andrew@andrewschein.... (Andrew Schein)',
 'beamesleach@gmail.... (Alex Leach)',
 'tjlahey at eon.uwaterloo.ca (tjlahey)',
 'jantod@gmail.... (Janto Dreijer)',
 'jjhelmus@gmail.... (Jonathan Helmus)',
 'eric@enthought.... (eric jones)',
 'robert.kern@gmai... (robert.kern@gmai...)',
 ...]

In [52]:
people_dict = []

In [53]:
for person in people_list:
    n_post = list(arx.data['From']).count(person)
    n_replies = list(arx.data[arx.data['In-Reply-To'] > 0]['From']).count(person)
    proportion = float(n_replies) / n_post
    people_dict.append({'name': person, '# of posts': n_post, '# of replies': n_replies, 'proportion': proportion})

In [54]:
new_frame = pd.DataFrame(people_dict)

From here we try to find correlation between # of messages by initializer of each thread and # of messages by responders of each thread


In [55]:
a_list = arx.data[arx.data['In-Reply-To'].apply(lambda x: x == None)]

In [56]:
ab_list = []

In [57]:
for i in range(len(a_list)):
    a = a_list.irow(i)
    b_list = arx.data[arx.data['In-Reply-To'] == a.name]
    for j in range(len(b_list)):
        b = b_list.irow(j)
        ab_list.append(((a['From'], a['Date']), (b['From'], b['Date'])))

In [58]:
ab_list[:100]


Out[58]:
[(('jochen at unc.edu (Jochen =?iso-8859-1?q?K=FCpper?=)',
   Timestamp('2001-09-27 14:26:13', tz=None)),
  ('jason at tishler.net (Jason Tishler)',
   Timestamp('2001-09-27 20:48:48', tz=None))),
 (('jochen at unc.edu (Jochen =?iso-8859-1?q?K=FCpper?=)',
   Timestamp('2001-09-27 14:26:13', tz=None)),
  ('jochen at jochen-kuepper.de (Jochen =?iso-8859-1?q?K=FCpper?=)',
   Timestamp('2001-09-28 14:48:13', tz=None))),
 (('python at lee-morgan.net (python at lee-morgan.net)',
   Timestamp('2001-10-11 15:14:33', tz=None)),
  ('travis at scipy.org (Travis N. Vaught)',
   Timestamp('2001-10-11 15:22:01', tz=None))),
 (('altis at semi-retired.com (Kevin Altis)',
   Timestamp('2001-10-15 17:39:46', tz=None)),
  ('jochen at jochen-kuepper.de (Jochen =?iso-8859-1?q?K=FCpper?=)',
   Timestamp('2001-10-25 03:08:15', tz=None))),
 (('ejones17 at austin.rr.com (ERIC JONES)',
   Timestamp('2001-10-23 15:29:38', tz=None)),
  ('jochen at jochen-kuepper.de (Jochen =?iso-8859-1?q?K=FCpper?=)',
   Timestamp('2001-10-24 01:17:57', tz=None))),
 (('ejones17 at austin.rr.com (eric jones)',
   Timestamp('2001-10-24 21:19:03', tz=None)),
  ('jochen at jochen-kuepper.de (Jochen =?iso-8859-1?q?K=FCpper?=)',
   Timestamp('2001-10-25 04:27:11', tz=None))),
 (('ejones17 at austin.rr.com (eric jones)',
   Timestamp('2001-10-25 13:27:37', tz=None)),
  ('jochen at jochen-kuepper.de (Jochen =?iso-8859-1?q?K=FCpper?=)',
   Timestamp('2001-10-26 03:26:14', tz=None))),
 (('eric at scipy.org (eric jones)',
   Timestamp('2001-10-26 15:27:20', tz=None)),
  ('jochen at unc.edu (Jochen =?iso-8859-1?q?K=FCpper?=)',
   Timestamp('2001-10-26 19:32:53', tz=None))),
 (('jochen at unc.edu (Jochen =?iso-8859-1?q?K=FCpper?=)',
   Timestamp('2001-10-29 20:36:02', tz=None)),
  ('travis at scipy.org (Travis N. Vaught)',
   Timestamp('2001-10-29 21:00:09', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2001-11-05 14:55:33', tz=None)),
  ('rossini at blindglobe.net (A.J. Rossini)',
   Timestamp('2001-11-05 16:51:42', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2001-11-05 14:55:33', tz=None)),
  ('strang at nmr.mgh.harvard.edu (Gary Strangman)',
   Timestamp('2001-11-05 18:15:09', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2001-11-05 15:07:04', tz=None)),
  ('magnus at thinkware.se (Magnus =?iso-8859-1?Q?Lyck=E5?=)',
   Timestamp('2001-11-06 11:40:53', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2001-11-06 15:05:36', tz=None)),
  ('magnus at thinkware.se (Magnus =?iso-8859-1?Q?Lyck=E5?=)',
   Timestamp('2001-11-09 12:35:46', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2001-11-09 15:12:20', tz=None)),
  ('magnus at thinkware.se (Magnus =?iso-8859-1?Q?Lyck=E5?=)',
   Timestamp('2001-11-09 17:12:30', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2001-11-09 15:12:20', tz=None)),
  ('jochen at unc.edu (Jochen =?iso-8859-1?q?K=FCpper?=)',
   Timestamp('2001-11-09 20:42:15', tz=None))),
 (('rlytle at tqs.com (Lytle, Robert TQO)',
   Timestamp('2001-11-09 20:50:38', tz=None)),
  ('jochen at jochen-kuepper.de (Jochen =?iso-8859-1?q?K=FCpper?=)',
   Timestamp('2001-11-10 02:14:42', tz=None))),
 (('arnd.baecker at physik.uni-ulm.de (arnd.baecker at physik.uni-ulm.de)',
   Timestamp('2001-11-21 10:12:58', tz=None)),
  ('jochen at jochen-kuepper.de (Jochen =?iso-8859-1?q?K=FCpper?=)',
   Timestamp('2001-11-28 01:43:36', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-02 23:00:36', tz=None)),
  ('prabhu at aero.iitm.ernet.in (Prabhu Ramachandran)',
   Timestamp('2002-01-03 03:11:59', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-03 05:23:15', tz=None)),
  ('prabhu at aero.iitm.ernet.in (Prabhu Ramachandran)',
   Timestamp('2002-01-03 07:26:10', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-03 05:23:15', tz=None)),
  ('fperez at pizero.colorado.edu (=?ISO-8859-1?Q?Fernando_P=E9rez?=)',
   Timestamp('2002-01-03 09:31:24', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-04 21:48:49', tz=None)),
  ('fperez at pizero.colorado.edu (=?ISO-8859-1?Q?Fernando_P=E9rez?=)',
   Timestamp('2002-01-04 23:16:40', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-04 21:48:49', tz=None)),
  ('fperez at pizero.colorado.edu (=?ISO-8859-1?Q?Fernando_P=E9rez?=)',
   Timestamp('2002-01-04 23:43:13', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-04 21:48:49', tz=None)),
  ('prabhu at aero.iitm.ernet.in (Prabhu Ramachandran)',
   Timestamp('2002-01-05 17:45:20', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-04 23:10:01', tz=None)),
  ('fperez at pizero.colorado.edu (=?ISO-8859-1?Q?Fernando_P=E9rez?=)',
   Timestamp('2002-01-05 00:30:21', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-05 04:46:11', tz=None)),
  ('fperez at pizero.colorado.edu (=?ISO-8859-1?Q?Fernando_P=E9rez?=)',
   Timestamp('2002-01-05 08:31:28', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-05 04:46:11', tz=None)),
  ('fperez at pizero.colorado.edu (=?ISO-8859-1?Q?Fernando_P=E9rez?=)',
   Timestamp('2002-01-05 08:52:05', tz=None))),
 (('pearu at cens.ioc.ee (Pearu Peterson)',
   Timestamp('2002-01-05 12:30:32', tz=None)),
  ('prabhu at aero.iitm.ernet.in (Prabhu Ramachandran)',
   Timestamp('2002-01-05 17:50:10', tz=None))),
 (('rob at pythonemproject.com (Rob)',
   Timestamp('2002-01-05 16:08:20', tz=None)),
  ('travis at scipy.org (Travis N. Vaught)',
   Timestamp('2002-01-05 17:08:02', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-05 17:15:03', tz=None)),
  ('prabhu at aero.iitm.ernet.in (Prabhu Ramachandran)',
   Timestamp('2002-01-05 18:28:01', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-07 06:48:54', tz=None)),
  ('prabhu at aero.iitm.ernet.in (Prabhu Ramachandran)',
   Timestamp('2002-01-07 08:44:42', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-07 14:48:07', tz=None)),
  ('prabhu at aero.iitm.ernet.in (Prabhu Ramachandran)',
   Timestamp('2002-01-07 16:08:37', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-07 15:16:40', tz=None)),
  ('fperez at pizero.colorado.edu (=?ISO-8859-1?Q?Fernando_P=E9rez?=)',
   Timestamp('2002-01-07 20:13:25', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-09 22:46:09', tz=None)),
  ('pearu at cens.ioc.ee (Pearu Peterson)',
   Timestamp('2002-01-10 08:39:25', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-09 22:46:09', tz=None)),
  ('prabhu at aero.iitm.ernet.in (Prabhu Ramachandran)',
   Timestamp('2002-01-10 12:06:13', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-10 14:37:17', tz=None)),
  ('prabhu at aero.iitm.ernet.in (Prabhu Ramachandran)',
   Timestamp('2002-01-10 16:19:40', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-10 15:57:07', tz=None)),
  ('pearu at cens.ioc.ee (Pearu Peterson)',
   Timestamp('2002-01-10 17:22:43', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-11 17:50:43', tz=None)),
  ('prabhu at aero.iitm.ernet.in (Prabhu Ramachandran)',
   Timestamp('2002-01-12 03:57:40', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-11 17:50:43', tz=None)),
  ('pearu at cens.ioc.ee (Pearu Peterson)',
   Timestamp('2002-01-12 09:34:31', tz=None))),
 (('rob at pythonemproject.com (Rob)',
   Timestamp('2002-01-12 15:39:37', tz=None)),
  ('prabhu at aero.iitm.ernet.in (Prabhu Ramachandran)',
   Timestamp('2002-01-12 17:38:30', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-12 20:20:06', tz=None)),
  ('pearu at cens.ioc.ee (Pearu Peterson)',
   Timestamp('2002-01-12 23:30:32', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-12 20:20:06', tz=None)),
  ('prabhu at aero.iitm.ernet.in (Prabhu Ramachandran)',
   Timestamp('2002-01-13 08:37:11', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-12 21:03:59', tz=None)),
  ('prabhu at aero.iitm.ernet.in (Prabhu Ramachandran)',
   Timestamp('2002-01-13 04:31:52', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-13 02:55:44', tz=None)),
  ('prabhu at aero.iitm.ernet.in (Prabhu Ramachandran)',
   Timestamp('2002-01-13 08:04:03', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-13 02:55:44', tz=None)),
  ('prabhu at aero.iitm.ernet.in (Prabhu Ramachandran)',
   Timestamp('2002-01-13 09:45:32', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-13 02:55:44', tz=None)),
  ('fperez at pizero.colorado.edu (=?ISO-8859-1?Q?Fernando_P=E9rez?=)',
   Timestamp('2002-01-13 23:13:25', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-14 03:01:03', tz=None)),
  ('fperez at pizero.colorado.edu (=?ISO-8859-1?Q?Fernando_P=E9rez?=)',
   Timestamp('2002-01-14 17:18:13', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-14 07:01:12', tz=None)),
  ('prabhu at aero.iitm.ernet.in (Prabhu Ramachandran)',
   Timestamp('2002-01-14 09:15:18', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-14 15:21:11', tz=None)),
  ('prabhu at aero.iitm.ernet.in (Prabhu Ramachandran)',
   Timestamp('2002-01-14 16:51:26', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-14 15:35:38', tz=None)),
  ('prabhu at aero.iitm.ernet.in (Prabhu Ramachandran)',
   Timestamp('2002-01-14 17:00:28', tz=None))),
 (('pearu at cens.ioc.ee (Pearu Peterson)',
   Timestamp('2002-01-14 16:10:28', tz=None)),
  ('prabhu at aero.iitm.ernet.in (Prabhu Ramachandran)',
   Timestamp('2002-01-14 19:19:02', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-14 16:13:10', tz=None)),
  ('pearu at cens.ioc.ee (Pearu Peterson)',
   Timestamp('2002-01-14 17:24:13', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-14 18:01:05', tz=None)),
  ('pearu at cens.ioc.ee (Pearu Peterson)',
   Timestamp('2002-01-14 19:05:42', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-14 18:01:05', tz=None)),
  ('prabhu at aero.iitm.ernet.in (Prabhu Ramachandran)',
   Timestamp('2002-01-14 19:17:43', tz=None))),
 (('oliphant.travis at ieee.org (Travis Oliphant)',
   Timestamp('2002-01-14 23:35:02', tz=None)),
  ('prabhu at aero.iitm.ernet.in (Prabhu Ramachandran)',
   Timestamp('2002-01-15 02:00:33', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-16 18:51:00', tz=None)),
  ('pearu at cens.ioc.ee (Pearu Peterson)',
   Timestamp('2002-01-16 20:31:45', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-18 07:07:40', tz=None)),
  ('prabhu at aero.iitm.ernet.in (Prabhu Ramachandran)',
   Timestamp('2002-01-18 09:51:22', tz=None))),
 (('pearu at cens.ioc.ee (Pearu Peterson)',
   Timestamp('2002-01-18 10:39:54', tz=None)),
  ('pearu at cens.ioc.ee (Pearu Peterson)',
   Timestamp('2002-01-19 00:04:30', tz=None))),
 (('pearu at cens.ioc.ee (Pearu Peterson)',
   Timestamp('2002-01-18 10:39:54', tz=None)),
  ('pearu at cens.ioc.ee (Pearu Peterson)',
   Timestamp('2002-01-19 18:07:39', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-18 17:23:37', tz=None)),
  ('pearu at cens.ioc.ee (Pearu Peterson)',
   Timestamp('2002-01-18 20:14:17', tz=None))),
 (('jochen at unc.edu (Jochen =?iso-8859-1?q?K=FCpper?=)',
   Timestamp('2002-01-18 20:22:59', tz=None)),
  ('prabhu at aero.iitm.ernet.in (Prabhu Ramachandran)',
   Timestamp('2002-01-19 03:39:24', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-19 19:10:53', tz=None)),
  ('pearu at cens.ioc.ee (Pearu Peterson)',
   Timestamp('2002-01-19 21:02:44', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-19 19:57:15', tz=None)),
  ('pearu at cens.ioc.ee (Pearu Peterson)',
   Timestamp('2002-01-19 21:13:57', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-19 20:02:09', tz=None)),
  ('prabhu at aero.iitm.ernet.in (Prabhu Ramachandran)',
   Timestamp('2002-01-20 04:15:57', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-21 05:08:44', tz=None)),
  ('jochen at jochen-kuepper.de (Jochen =?iso-8859-1?q?K=FCpper?=)',
   Timestamp('2002-01-22 03:06:33', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-22 04:58:57', tz=None)),
  ('jochen at jochen-kuepper.de (Jochen =?iso-8859-1?q?K=FCpper?=)',
   Timestamp('2002-01-22 06:46:33', tz=None))),
 (('pearu at cens.ioc.ee (Pearu Peterson)',
   Timestamp('2002-01-22 21:00:39', tz=None)),
  ('prabhu at aero.iitm.ernet.in (Prabhu Ramachandran)',
   Timestamp('2002-01-23 08:17:30', tz=None))),
 (('pearu at cens.ioc.ee (Pearu Peterson)',
   Timestamp('2002-01-22 21:00:39', tz=None)),
  ('pearu at cens.ioc.ee (Pearu Peterson)',
   Timestamp('2002-02-04 23:37:58', tz=None))),
 (('andreas.frankfurt at gmx.de (Andreas Lorenz)',
   Timestamp('2002-01-24 13:54:08', tz=None)),
  ('prabhu at aero.iitm.ernet.in (Prabhu Ramachandran)',
   Timestamp('2002-01-24 17:19:51', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-24 17:57:56', tz=None)),
  ('prabhu at aero.iitm.ernet.in (Prabhu Ramachandran)',
   Timestamp('2002-01-25 02:50:54', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-01-30 17:58:59', tz=None)),
  ('fperez at pizero.colorado.edu (=?ISO-8859-1?Q?Fernando_P=E9rez?=)',
   Timestamp('2002-01-30 19:12:45', tz=None))),
 (('loredo at astrosun.astro.cornell.edu (Tom Loredo)',
   Timestamp('2002-02-06 21:00:04', tz=None)),
  ('pearu at cens.ioc.ee (Pearu Peterson)',
   Timestamp('2002-02-07 10:08:43', tz=None))),
 (('pnmiller at pacbell.net (Pat Miller)',
   Timestamp('2002-02-08 19:38:00', tz=None)),
  ('prabhu at aero.iitm.ernet.in (Prabhu Ramachandran)',
   Timestamp('2002-02-10 04:36:34', tz=None))),
 (('pnmiller at pacbell.net (Pat Miller)',
   Timestamp('2002-02-08 19:38:00', tz=None)),
  ('pearu at cens.ioc.ee (Pearu Peterson)',
   Timestamp('2002-02-11 19:22:59', tz=None))),
 (('pnmiller at pacbell.net (Pat Miller)',
   Timestamp('2002-02-08 19:38:02', tz=None)),
  ('pearu at cens.ioc.ee (Pearu Peterson)',
   Timestamp('2002-02-11 18:53:07', tz=None))),
 (('pnmiller at pacbell.net (Pat Miller)',
   Timestamp('2002-02-10 09:51:46', tz=None)),
  ('prabhu at aero.iitm.ernet.in (Prabhu Ramachandran)',
   Timestamp('2002-02-11 17:46:40', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-02-11 20:22:45', tz=None)),
  ('fperez at pizero.colorado.edu (=?ISO-8859-1?Q?Fernando_P=E9rez?=)',
   Timestamp('2002-02-11 21:35:41', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-02-11 20:22:45', tz=None)),
  ('pearu at cens.ioc.ee (Pearu Peterson)',
   Timestamp('2002-02-11 22:45:39', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-02-11 23:03:18', tz=None)),
  ('pearu at cens.ioc.ee (Pearu Peterson)',
   Timestamp('2002-02-12 10:57:09', tz=None))),
 (('pnmiller at pacbell.net (Pat Miller)',
   Timestamp('2002-02-12 07:32:16', tz=None)),
  ('prabhu at aero.iitm.ernet.in (Prabhu Ramachandran)',
   Timestamp('2002-02-16 17:56:48', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-02-12 17:06:30', tz=None)),
  ('pearu at cens.ioc.ee (Pearu Peterson)',
   Timestamp('2002-02-12 19:58:52', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-02-13 08:20:40', tz=None)),
  ('pearu at cens.ioc.ee (Pearu Peterson)',
   Timestamp('2002-02-13 12:14:06', tz=None))),
 (('heiko at hhenkelmann.de (Heiko Henkelmann)',
   Timestamp('2002-02-13 08:50:07', tz=None)),
  ('pearu at cens.ioc.ee (Pearu Peterson)',
   Timestamp('2002-02-13 09:09:21', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-02-13 17:05:26', tz=None)),
  ('pearu at cens.ioc.ee (Pearu Peterson)',
   Timestamp('2002-02-13 22:29:20', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-02-13 17:44:38', tz=None)),
  ('oliphant at ee.byu.edu (Travis Oliphant)',
   Timestamp('2002-02-13 18:54:59', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-02-14 07:10:42', tz=None)),
  ('peterson at math.utwente.nl (Pearu Peterson)',
   Timestamp('2002-02-14 08:28:46', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-02-14 07:10:42', tz=None)),
  ('prabhu at aero.iitm.ernet.in (Prabhu Ramachandran)',
   Timestamp('2002-02-14 08:32:19', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-02-14 07:33:14', tz=None)),
  ('prabhu at aero.iitm.ernet.in (Prabhu Ramachandran)',
   Timestamp('2002-02-14 08:42:51', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-02-14 07:33:14', tz=None)),
  ('pearu at cens.ioc.ee (Pearu Peterson)',
   Timestamp('2002-02-14 08:43:54', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-02-14 09:34:49', tz=None)),
  ('pearu at cens.ioc.ee (Pearu Peterson)',
   Timestamp('2002-02-14 12:16:45', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-02-14 19:49:42', tz=None)),
  ('pearu at cens.ioc.ee (Pearu Peterson)',
   Timestamp('2002-02-14 20:59:08', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-02-14 19:49:42', tz=None)),
  ('pearu at cens.ioc.ee (Pearu Peterson)',
   Timestamp('2002-02-14 21:29:01', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-02-14 20:32:58', tz=None)),
  ('pearu at cens.ioc.ee (Pearu Peterson)',
   Timestamp('2002-02-14 23:32:13', tz=None))),
 (('heiko at hhenkelmann.de (Heiko Henkelmann)',
   Timestamp('2002-02-14 20:43:24', tz=None)),
  ('pearu at cens.ioc.ee (Pearu Peterson)',
   Timestamp('2002-02-14 20:47:01', tz=None))),
 (('loredo at astrosun.astro.cornell.edu (Tom Loredo)',
   Timestamp('2002-02-14 21:15:16', tz=None)),
  ('a.schmolck at gmx.net (A.Schmolck)',
   Timestamp('2002-02-15 17:58:23', tz=None))),
 (('jwp at cns.nyu.edu (Jon Peirce)',
   Timestamp('2002-02-15 14:30:20', tz=None)),
  ('prabhu at aero.iitm.ernet.in (Prabhu Ramachandran)',
   Timestamp('2002-02-15 17:24:53', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-02-15 19:53:28', tz=None)),
  ('oliphant.travis at ieee.org (Travis Oliphant)',
   Timestamp('2002-02-16 03:08:05', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-02-15 20:04:50', tz=None)),
  ('fperez at pizero.colorado.edu (=?ISO-8859-1?Q?Fernando_P=E9rez?=)',
   Timestamp('2002-02-15 21:15:30', tz=None))),
 (('eric at scipy.org (eric)', Timestamp('2002-02-15 20:04:50', tz=None)),
  ('pearu at cens.ioc.ee (Pearu Peterson)',
   Timestamp('2002-02-15 21:27:57', tz=None))),
 (('heiko at hhenkelmann.de (Heiko Henkelmann)',
   Timestamp('2002-02-15 20:49:49', tz=None)),
  ('pearu at cens.ioc.ee (Pearu Peterson)',
   Timestamp('2002-02-15 20:56:53', tz=None))),
 (('heiko at hhenkelmann.de (Heiko Henkelmann)',
   Timestamp('2002-02-15 21:33:49', tz=None)),
  ('fperez at pizero.colorado.edu (=?ISO-8859-1?Q?Fernando_P=E9rez?=)',
   Timestamp('2002-02-15 21:46:19', tz=None)))]

In [59]:
type(ab_list[0][0][1])


Out[59]:
pandas.tslib.Timestamp

In [60]:
def getNumReplies(author, time):
    filteredByAuthor = arx.data[arx.data['From'] == author]
    filteredByTime = filteredByAuthor[filteredByAuthor['Date'] <= time]
    return len(filteredByTime)

In [61]:
new_list = map(lambda x: (getNumReplies(x[0][0], x[0][1]), getNumReplies(x[1][0], x[1][1])), ab_list)

In [62]:
x = []
y = []

In [63]:
for elem in new_list:
    x.append(elem[0])
    y.append(elem[1])

In [64]:
plt.scatter(x,y)


Out[64]:
<matplotlib.collections.PathCollection at 0x10c08ca50>

In [65]:
ab_list2 = set()
for i in range(len(a_list)):
    a = a_list.irow(i)
    b_list = arx.data[arx.data['In-Reply-To'] == a.name]
    for j in range(len(b_list)):
        b = b_list.irow(j)
        ab_list2.add((a['From'], b['From']))
ab_list2 = list(ab_list2)

In [66]:
def getNumReplies2(author):
    filteredByAuthor = arx.data[arx.data['From'] == author]
    return len(filteredByAuthor)

In [67]:
new_list2 = map(lambda x: (getNumReplies2(x[0]) , getNumReplies2(x[1])), ab_list2)

In [68]:
x = []
y = []
for elem in new_list:
    x.append(elem[0])
    y.append(elem[1])
plt.scatter(x,y)


Out[68]:
<matplotlib.collections.PathCollection at 0x10c2e3c50>

In [68]: